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MODULATION OF HUMAN SODIUM CHANNELS IN DORSAL ROOT GANGLIA 

FIELD OF THE INVENTION 

The present invention relates to a novel human tetrodotoxin-resistant sodium channel 
and related nucleotides, as well as screening assays for identifying agents useful in treating 
acute or chronic pain or other hyperexcitability states. This application is related to U.S. 
Provisional Apphcation 60/072,990, filed January 29, 1998; U,S. Provisional Application 
60/109,402 filed November 20, 1998; U.S. Provisional Application 60/109,666 filed on 
November 20, 1998 ; PCI International Application PCT/US99/02008 filed January 29, 1999 
and U.S. Patent Application 09/354,147 filed July 16, 1999, all of which are herein 
incorporated by reference in their entirety. 

BACKGROUND 
A. Sodium Channels 

Voltage-gated sodium channels are a class of specialized protein molecules that act as 
molecular batteries permitting excitable cells (neurons and muscle fibers) to produce and 
propagate electrical impulses. Voltage-gated Na^ channels &om rat brain are composed of 
three subunits, the pore-forming a subunit (260 KDa) and two auxiliary subunits, bl (36 
KDa) and b2 (33 KDa) that may modulate die properties of the a-subunit; the a subunit is 
sufficient to form a functional channel that generates a Na current flow across the membrane 
(Catterall, (1993) Trends Neurosci. 16, 500-506; Isom et al, (1994) Neuron 12. 1 183-1 194). 
Nine distinct a subunits have been identified in vertebrates and are encoded by members of an 
expanding gene family (Goldin (1 995) Handbook of receptors and channels (North, editor) 
CRC Press; Akopian et al, (1996) Nature 379, 257-262; Akopian et al, (1997) FEES Lett. 
400, 183-187; Sangameswaran et al., (1996) J. Biol. Chem. 271, 5953-5956) and respective 
orthologues of a number of them have been cloned from various mammaUan species including 
humans. Specific a subunits are expressed in a tissue- and developmentally-specific manner 
(Beckh et al, (1989) EMBO J. 8, 3611-3616; Mandel. (1992) J. Membr. Biol. 125, 193-205). 
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Aberrant expression patterns or mutations of voltage-gated sodium channel a-subunits 
underlie a number of human and animal disorders (Roden & George, (1997) Am. J. Physiol. 
273, H511-H525; Ptacek, (1997) Neuromuscul. Disord. 7, 250-255; Cannon, (1997) 
Neuromuscul. Disord. 7; 241-249; Cannon, (1996) Trends Neuiosci. 19, 3-10); Rizzo etaL, 

5 (1996) Eur. Neurol. 36, 3-12). 

Voltage-gated sodium channel a-subunits consist of four domains (D 1-4) of varying 
internal homology but of similar predicted structure, connected by three intracellular loops 
(Ll-3). The four domains fold to fomi a channel that opens to both the cytoplasm and the 
extracellular space via a pore. The pore opens and closes depending upon the physiological 

10 state of the cell membrane. 

Each domain consists of six transmembrane segments (SI -6) that allow the protein to 
weave through the membrane with intra- and extracellular linkers. The linkers of S5-S6 
segments of the four domains contain sequences that Une the pore of the channel, and a highly 
conserved subset of amino acids that acts as a filter to selectively allow sodium ions to 

15 traverse the channel pore into the cytoplasm, thus generating an electric current. The 
amphiphatic S4 segment, in each of the four domains, rich in basic residues repeated every 
third amino acid, acts as a voltage sensor and undergoes a conformational change as a result of 
the change in the voltage difference across the cell mraibrane. This in turn triggers the 
conformational change of the protein to open its pore to the extracellular Na* ion gradient. 

20 In most of the known voltage-gated sodium channel a-subunits the channels close and 

change into an inoperable state quickly (inactivate) within a few milliseconds after opening of 
the pore (activation); SNS-type channels, on the other hand, inactivate slowly and require a 
greater voltage change to activate. L3, the loop that links domains D3 and D4, contains a 
tripeptide which acts as an intracellular plug that closes the pore after activation, thus inducing 

25 the channel to enter the inactive state. After inactivation, these chaimels further undergo 
conformational change to restore flieir resting state and become available for activation. This 
period is referred to as recovery fiom inactivation (repriming). Different chaimels reprime at 
different rates, and repriming in SNS is relatively rapid. 
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Based on amino acid similarities, the voltage-gated sodium channel family has been 
further subdivided into two sub&milies (Felipe et al^ (1994) J. Biol. Chem. 269, 
30125-30131). Eight of the nine cloned channels belong to subfamily 1 . They share many 
structural features, particularly in their S4 transmembrane segments. However, some of them 
have been shown to have distinct kinetic properties of inactivation and repriming. Only a 
single channel of subfamily 2, also referred to as atypical channels, has been identified in 
human, rat and mouse tissues. This subfamily is primarily characterized by reduced numbers 
of basic residues in its S4 segments, and thus is predicted to have different 
voltage-dependence compared to subfamily 1. The physiological function of subfamily 2 
channels is currently unknown because its electrophysiological properties have not yet been 
elucidated. 

The blocking of voltage-gated sodium channels by tetrodotoxin, a neurotoxin, has 
served to functionally classify these channels into sensitive (TTX-S) and resistant (TTX-R) 
phenotypes. Two mammalian TTX-R channels have so far been identified, one specific to the 
cardiac muscle and to very limited areas of the central nervous system (CNS) and the second, 
SNS, is restricted to peripheral neurons (PNS) of the dorsal root ganglia (DRG) and trigeminal 
ganglia. Specific anuno acid residues that confer resistance or sensitivity to TTX have been 
localized to the ion selectivity filter of the channel pore. The SNS channel is also described in 
International Patent Application WO 97/01577. 

B. Role of Sodium Channels in Disease States 

Because different Na* channel a-subunit isotypes exhibit different kinetics and 
voltage-dependence, the firing properties of excitable cells depend on the precise mixture of 
channel types that they express. Mutants of the cardiac and skeletal muscle a-subunit have 
been shown to cause a number of muscle disorders. Some examples are as follows: A change 
of a single basic amino acid residue in the S4 of the skeletal muscle channel is sufficient to 
change the kinetic properties of this channel and induce a disease state in many patients. A 
tripeptide deletion in L3 of the cardiac channel, proximal to the inactivation gate, mduces a 
cardiac disorder called Long QT syndrome. A single amino acid change in the S5-S6 linker of 
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domain 1 of ScnSa, the region lining the pore of the channel, causes the mouse mutant 
"jolting*'. The total loss of this channel by a different mutation causes motor end plate '•med" 
disease in mice. This mutation is characterized by loss of motor neuron stimulation of the 
innervated muscle. 

C. Sodium Channels and Pain 

Axonal injury (injury to nerve fibers, also called axons) can produce chronic pain 
(termed neuropathic pain). A number of studies have demonstrated altered excitability of the 
neuronal cell body and dendrites after axonal injury (Eccles et al, (1958) J. Physiol. 143: 
11-40; Gallego et aL, (1987) J. Physiol. (Lond) 391, 39-56; Kuno & Llinas, (1970) J. Physiol. 
(Lond.) 210, 807-821), and there is evidence for a change in Na^ channel density over the 
neuronal cell body and dendrites following axonal injury (Dodge & Cooley, (1973) IBM J. 
Res. Dev. 17, 219-229; Titmus & Faber (1986) J. Neurophysiol. 55, 1440-1454; Semagor et 
a/., (1986) Proc. Natl. Acad. Sci. USA 83, 7966-7970). The expression of abnormal mixtures 
of different types of sodium channels in a neuronal cell can also lead to abnormal firing (Riz2X) 
et al^ (1996) Eur. Neurol. 36, 3-12), and can contribute to hyperexcitability, paresfliesia or 
pam. 

Recent studies on rat sensory DRG neurons have demonstrated a dramatic change in 
the expression profile of TTX-R and TTX-S currents and in a number of mRNA transcripts 
that could encode the charmels responsible for these currents in DRG neurons following 
various insults (Rizzo et a/., (1995) Neurobiol. Dis. 2: 87-96; Cummms et aL, (1997) J. 
Neurophysiol. 17, 3503-3514; Dib-Hajj etaL, (1996) Proc. Natl. Acad. Sci. USA 93, 
14950-14954). For example, it has been shown an attenuation of the slowly inactivating, 
TTX-R current and simultaneous enhancement of the rapidly inactivating, TTX-S Na^ currents 
in identified sensory cutaneous afferent neurons following axotomy (Rizzo et aL, (1995) 
Neurobiol. Dis. 2, 87-96). A loss of TTX-S, slowly repriming current and TTX-R current and 
a gain in TTX-S, rapidly repriming current in nociceptive (pain) neurons following axotomy 
(Cummins & Waxman (1997) J. Neurophysiol 17, 3503-3514), down-regulation of SNS 
transcripts and a simultaneous up-regulation of a-III Transcripts has also been shown 
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(Dib-Hajj et al, (1996) Proc. Natl. Acad. Sci. USA 93, 14950-14954). Also associated with 
axotomy is a moderate elevation in the levels of al and all mRNAs (Waxman et aL, (1994) J. 
Neurophysiol 72, 466-470). These changes in the sodium channel profile appear to contribute 
to abnormal firing that underlies neuropathic pain that patients suffer following axonal injury. 

Inflammation, which is also associated with pain (termed inflammatory pain), also 
causes alteration in the sodium current profile in nociceptive DRG neurons. Inflammatory 
modulators up-regulate TTX-R current in small C-type nociceptive DRG neurons in culture 
(Gold et al., (1996) Proc. Natl. Acad. Sci. USA 93, 1 108-1 1 12; England et al, (1996) J. 
Physiol. 495, 429-440). The rapid action of these modulators suggests that their action 
include posttranslational modification of existing TTX-R channels. It has now been 
determined that inflammation also increases a TTX-R Na* current and up-regulates SNS 
transcripts in C-type DRG neurons (Tanaka et al, (1998) Neuroreport. 9, 967-972). This data 
suggests that changes in the sodium current profile contribute to inflammation evoked-pain. 

D. Therapies for Chronic Pain : 

A variety of classes of drugs (anticonvulsants such as phenytom and carbamazepine; 
anti-anhythmics such as mexitine; local anesthetics such as lidocaine) act on Na^ channels. 
Since the various Na* channels produce sodium currents with different properties, selective 
blockade or activation (or other modulation) of specific channel subtypes is expected to be of 
significant therapeutic value. Moreover, the selective expression of certain a-subunit isoforms 
(PNl, SNS, NaN) in specific types of neurons provides a means for selectively altering their 
behavior. 

Nociceptive neurons of the DRG are the major source of the PNS TTX-R Na"^ 
current. Thus, the Na* channels producing TTX-R currents provide a relatively specific target 
for the manipulation of pain-producing neurons. The molecular structure of one TTX-R 
channel in these DRG neurons, SNS, has been identified but, prior to our research, it has not 
been determined whether there are other TTX-R channels in these neurons. If such channels 
could be identified, they would be ideal candidates as target molecules that are preferentially 
expressed in nociceptive neurons, and whose modulation would attenuate pain transmission. 
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SUMMARY OF THE INVENTION 

The present invention includes an isolated nucleic acid which encodes a voltage gated 
Na* channel that is preferentially expressed in dorsal root ganglia or trigeminal ganglia (the 
NaN channel). (In our preceding U.S. Provisional Application 60/072,990, this NaN channel 
was referred to by its previous name **NaX"). In a preferred embodiment, the isolated nucleic 
acid comprises the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 7A (SEQ ID NO: 4), 
Figure 8A (SEQ ID NO: 6), Figure 1 lA (SEQ ID NO: 41), allelic variants of said sequences 
or nucleic acids that hybridize to the foregoing sequences under stringent conditions. 

In another embodiment, the invention includes an expression vector comprising an 
isolated nucleic acid which encodes the voltage gated Na* channel that is preferentially 
expressed in dorsal root ganglia or trigeminal ganglia either alone or with appropriate 
regulatory and expression control elements. In a preferred embodiment, the expression vector 
comprises an isolated nucleic acid having the sequence shown in Figure 1 (SEQ ID NO: 1), 
Figure 7A (SEQ ID NO: 4), Figure 8A (SEQ ID NO: 6), Figure 1 1 A (SEQ ID NO: 41), allelic 
variants of said sequences or nucleic acids that hybridize to the foregoing sequences under 
stringent conditions. 

The present invention further includes a host cell transformed with an expression 
vector comprising an isolated nucleic acid which encodes a voltage gated Na^ chaimcl that is 
preferentially expressed in dorsal root gangUa or trigeminal ganglia with ^propriate 
regulatory and expression control elements. In a preferred embodiment, the expression vector 
comprises an isolated nucleic acid having the sequence shown in Figure 1 (SEQ ID NO: 1), 
Figure 7A (SEQ ID NO: 4), Figure 8A (SEQ ID NO: 6), Figure 1 1 A (SEQ ID NO: 41), allelic 
variants of said sequences or nucleic acids that hybridize to the foregoing sequences under 
stringent conditions. 

The present invention also includes an isolated voltage gated Na* channel that is 
preferentially expressed in dorsal root ganglia or trigeminal ganglia. In a preferred 
embodiment, the channel has the amino acid sequence of Figure 2 (SEQ ID NO: 3), Figure 7B 
(SEQ ID NO: 5), Figure SB (SEQ ID NO: 8) or Figure 1 IB (SEQ ID NO: 42), or is encoded 
by a nucleic acid having the sequence shown in Figure 2 (SEQ ID NO: 3), Figure 7B (SEQ ID 
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NO: 5), Figure 8B (SEQ ID NO: 8) or Figure 1 IB (SEQ ID NO: 42), alleUc variants of said 
sequences or nucleic acids that hybridize to the foregoing sequences under stringent 
conditions. Peptide fragments of the channel are also included. 

Another aspect of the invention is a method to identify an agent that modulates the 
activity of the NaN channel, comprising the steps of bringing the agent into contact with a cell 
that expresses the Na* channel on its surface and measuring depolarization, or any resultant 
changes in the sodixun current. The measuring step may be accomplished with voltage clamp 
measurements, by measuring depolarization, the level of intracellular sodium or by measuring 
sodium influx. 

Another aspect of the invention is a method to identify an agent that modulates the 
transcription or translation of mRNA encodmg the NaN chaimel The method comprises the 
steps of bringing the agent into contact with a cell that expresses the Na* channel on its surface 
and measuring the resultant level of expression of the Na* channel. 

The invention also includes a method to treat pain, paraesthesia and hyperexcitability 
phenomena in an animal or human subject by administering an effective amount of an agent 
cq>able of modulating, such as by mhibiting or enhancing, Na* current flow flirough NaN 
channels in DRG or trigeminal neurons. The method may mclude administering an effective 
amount of an agent capable of modulating the transcription or translation of mRNA encoding 
the NaN chatmel. 

Another aspect of the invention is an isolated nucleic acid that is antisense to the 
nucleic acids described above. In a preferred embodiment, the antisense nucleic acids are of 
suflRcient length to modulate the expression of NaN channel mRNA in a cell containing the 
mRNA. 

Another aspect of the invention is a scintigraphic method to image the loci of pain 
generation or provide a measure the level of pain associated with DRG or trigeminal neuron 
mediated hyperexcitabilify in an animal or human subject by administering labeled 
monoclonal antibodies or other labeled ligands specific for the NaN Na^ channel. 
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Another aspect of the invention is a method to identify tissues, cells and cell types 
that express the NaN sodium chaimel. This method comprises the step of detecting NaN on 
the cell surface, or en route to the cell surface, or the presence of NaN encoding mRNA. 

The present invention further includes a method of producing a transformed cell that 
5 expresses an exogenous NaN encoding nucleic acid, comprising the step of transforming the 
cell with an expression vector comprising an isolated nucleic acid having the sequence shown 
in Figures 1, 7 A, 8 A or 11 A, allelic variants of said sequences or nucleic acids that hybridize 
to the foregoing sequences under stringent conditions, together with appropriate regulatory 
and expression control elements. The invention also includes a method of producing 
10 recombinant NaN protein, comprising the step of culturing the transformed host under 

conditions in which the NaN sodium channel or protein is expressed, and recovering the NaN 
protein. 

The invention also includes an isolated antibody specific for the MiA^ channel or 
polypeptide fragment thereof The isolated antibody may be labeled. 

15 Another aspect of the invention includes a therapeutic composition comprising an 

effective amount of an agent enable of decreasing rapidly repriming sodium current flow in 
axotomized, inflamed or otherwise injured DRG neurons or in normal DRG neurons that are 
being driven to fire at high firequency. The invention also includes a method to treat acute 
pain or acute or chronic neuropathic or inflammatory pain and hyperexcitability phenomena in 

20 an animal or a human patient by administering the th^:apeutic con[q)osition. 

The present invention also includes a method to screen candidate compounds for use 
in treating pain and hyperexcitability phenomena by testing their ability to alter the expression 
or activity of an NaN channel mRNA or protein in axotomized, inflamed or otherwise injured 
DRG neurons. 
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BRIEF DESCRIPTION OF THE DRAWING FIGURES 

Figure 1 shows the sequrace of the rat NaN cDNA (SEQ ID NO: 3). 

Figure 2 shows the putative amino acid sequence of the rat NaN cDNA (SEQ ID NO: 
3). Predicted transniembrance segments ofdomains I -IV are underUned. The amino acid 
serine "S" in DI-SS2, implicated in the TTX-R phenotype, is in bold face type. 

Figure 3 presents a schematic diagram of predicted secondary structure of the NaN 
a-subunit. 

Figure 4 shows the results of RT-PCR analysis for a-NaN in extracts of various 
tissues using iVaiV-specific primers. NaN is abundantly expressed in dorsal root and 
trigeminal ganglia. Low levels of NaNQie detected in cerebral hemisphere and retina tissues. 
No detectable ATa^ signal is seen in cerebellum, optic nerve, spinal cord, sciatic nerve, 
superior cervical ganglia, skeletal muscle, cardiac muscle, adrenal gland, uterus, liver and 
kidney. 

Figure 5 shows the tissue distribution of a-NoN by in situ hybridization. Trigeminal 
ganglion neurons show moderate-to-high hybridization signal (A). Dorsal root ganglion 
neurons show moderate-to-high hybridization signal in small neurons (B). Hybridization 
signal is attenuated in large neurons (arrow). (C) Sense probe shows no signal in DRG 
neurons. No hybridization signal is seen in spinal cord, cerebellum and liver (D-F) . All 
tissues are from adult Sprague-Dawley rat (scale bars = SO micrometer). 

Figure 6 shows the predicted lengths of domain I amplification products of rat a- 
subunits and their subunit-specific restriction enzyme profile. 

Figure 7 sets forth the (A) nucleotide (SEQ ID NO: 4) and (B) amino acid (SEQ ID 
NO: 5) sequences of the murine NaN. 

Figure 8 is a partial (A) nucleotide sequence (SEQ ID NO: 6) of the human NaN and 
partial (B) amino acid sequence (SEQ ID NO: 8) of the human NaN protein. 

Figure 9 shows cultures of DRG neurons obtained from L4/5 ganglia of adult rats that 
were reacted with antibody to NaN and then processed for mimunofluorescent localization. 
(A-B) NaN immunostaining is prominent within the cell bodies of DRG neurons. (C) NaN is 
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present in the neuritic outgrowths, as well as the cell bodies, of DRG neurons. Nomarski (D) 
and fluorescent (D') images of a neuron that does not express NaN protein. 

Figure 10 shows the location of Scnl la and related genes on distal mouse 
chromosome 9. (A) Haplotypes from the Jackson BSS backcross. Black boxes represent 
C57BL/6J alleles and white boxes represent SPRET/Ei alleles. The number of animals with 
each haplotype is given below each column. Missing data was inferred from adjacent data 
when typing was ambiguous. (B) Map of distal chromosome 9 based on data in (A). 
Positions of Scn5a and ScnlOa from the MGD consensus map and the locations of the hiunan 
orthologs are indicated. Numbers are cM positions on the consensus map 
(htq)://www.infonnatics.jax.org/bin/ccr/index). 

Figure 1 1 shows the (A) cDNA nucleotide sequence (SEQ ID NO: 41) of the human 
NaN gene spanning the complete open reading frame and (B) sets forth the amino acid 
sequence (SEQ ID NO: 42) of the fall length human NaN protein. 

DETAILED DESCRIPTION 

The present invention relates to a novel gene that Applicants have discovered, called 
NaN NaN encodes a previously unidentified protein, referred to herein as NaN, that belongs 
to the a-subunit voltage-gated sodium channel protein family and that produces a TTX-R 
sodium current. Such channels underlie the generation and propagation of impulses in 
excitable cells like neurons and muscle fibers. NaN is a novel sodium channel, with a 
sequence distinct from other, previously identified, channels. The preferential expression of 
NaN on sensory, but not other neurons, makes it a very use&l target for diagnostic and/or 
therapeutic uses in relation to acute and/or chronic pain pathologies.. 

A. Definitions 

This specification uses several technical terms and phrases which are intended to have 
the following meanings: 

The phrase ^'modulate" or "altei" refers to up- or down-regulating the level or activity 
of a particular receptor, ligand or current flow. For example an agent might modulate Na* 
current flow by inhibiting (decreasing) or enhancing (increasing) Na^ current flow. Similarly, 
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an agent might modulate the level of expression of the NaN sodium channel or the activity of 
the NaN channels that are expressed. 

The phrase ''sodium current or '"Na* current" means the flow of sodium ions across a 
cell membrane, often through channels (specialized protein molecules) that are specifically 
permeable to certain ions, in this case sodium ions. 

The phrase "voltage gated" means that the ion channel opens when the cell membrane 
is in a particular voltage range. Voltage-sensitive sodium channels open when the membrane 
is depolarized. They then permit Na^ ions to flow into the cell, producing further 
depolarization. This permits the cell to generate electrical impulses (also known as "action 
potentials"). 

The phrase ''rapidly repriming" means that the cxurents recover from inactivation more 
rapidly than do such currents in most other voltage gated sodium channel family members. 

The terms "TTX-R" and "TTX-S" means that the flow of current through a cell 
membrane is, respectively, resistant or sensitive to tetrodotoxin (a neurotoxin produced in 
certain species) at a concentration of about 100 nM. 

The phrase "peripheral nervous system (PNS)" means the part of the nervous system 
outside of the brain and spinal cord, i.e., the spinal roots and associated ganglia such as dorsal 
root ganglia (DRG) and trigeminal ganglia, and the peripheral nerves. 

The phrase "inhibits Na^ current flow" means that an agent has decreased such current 
flow relative to a control cell not exposed to that agent. A preferred inhibitor will selectively 
inhibit such current flow, without affecting the current flow of other sodium channels; or it 
will inhibit Na^ currrat in the channel of interest to a much larger extent dian in other 
channels. 

The phrase "enhances Na* current flow" means that an agent has increased such 
current flow relative to a control cell not exposed to that agent. A preferred agent will 
selectively increase such current flow, without affecting the current flow of other sodium 
channels; or it will increase Na^ current in the channel of interest to a much larger extent than 
in other channels. 
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The phrase "specifically hybridizes" ref^s to nucleic acids which hybridize under 
highly stringent or moderately stringent conditions to the nucleic acids encoding the NaN 
sodium channel, such as the DNA sequence of SEQ ID NO: 1, 4, 6 or 41. 

The phrase "isolated nucleic acid" refers to nucleic acids that have been separated from 
or substantially purified relative to contaminant nucleic acids encoding other polypeptides. 
**Nucleic acids" refers to all forms of DNA and RNA, including cDNA molecules and 
antisense RNA molecules. 

The phrase "RT-PCR" refers to the process of reverse transcription of RNA (RT) using 
the enzyme reverse transcriptase, followed by the amplification of certain cDNA templates 
using the polymerase chain reaction (PCR); PGR requires generic or gene-specific primers and 
thermostable DNA polymerase, for example, Taq DNA polymerase. 

The phrase *'preferentially expressed" means that voltage gated Na* channel is 
expressed in the defined tissues in detectably greater quantities than in other tissues. For 
instance, a voltage gated Na^ channel that is preferentially expressed in dorsal root ganglia or 
trigeminal ganglia is found in detectably greater quantities in dorsal root ganglia or trigeminal 
ganglia when compared to other tissues or cell types. The quantity of the voltage gated Na* 
channel may be detected by any available means, including the detection of specific RNA 
levels and detection of the channel protein with specific antibodies. 

B. Characterization of the NaN Sodium Channel 

Hie present invention relates to a previously unidentified, voltage-gated sodium 
channel a-subunit {NaN), predicted to be TTX-R, , voltage-gated, and preferentially expressed 
in sensory neurons innervating the body (dorsal root ganglia or DRG) and the face (trigeminal 
ganglia). The predicted open reading frame (ORF), the part of the sequence codmg for the 
NaN protein molecule, has been determined with the putative amino acid sequence from 
different species (rat, mouse, human) presented in Figures 2 (SEQ ID NO: 3), 7B (SEQ ID 
NO: 5), 8B (SEQ ID NO: 8) or 1 IB (SEQ ID NO: 42). 

All of the relevant landmark sequences of voltage-gated sodium channels are present 
in NaN at the predicted positions, indicating that Mz// belongs, to the sodium channel family. 
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But NaN is distinct from all other previously identified Na channels, sharing a sequence 
identity of less than 53% with each one of them. NaN is distinct from SNS, the only other 
TTX-R Na* channel subunit that has been identified, until our discovery, in PNS. We have 
identified and cloned iVaiV without using any primers or probes that are based upon or specific 
to SNS, Moreover, NaN and SNS share only 47% similarity of their predicted open reading 
fiame (ORF), comparable to the limited sunilarity oiNaN to all subfamily 1 members. 

The low sequence similarity to existing a-subunits clearly identifies MiA^as a novel 
gene, not simply a variant of an existing channel. Sequence variations compared to the other 
voltage-gated channels indicate that NaN may be the prototype of a novel and previously 
unidentified, third class of TTX-R channels that may possess distinct properties compared to 
SNS. NaN and SNS, which are present in nociceptive DRG and trigeminal neurons, may 
respond to pharmacological interventions in dilRFerent ways. The preferential expression of 
NaN in sensory DRG and trigeminal neurons provides a target for selectively modifying the 
behavior of these nerve cells while not affectmg other nerve cells in the brain and spinal cord. 
A fiuiher elucidation of the properties of NaN channels will be important to understand more 
fiilly the effects of drugs designed to modulate the fimction of the "TTX-R" currents which are 
characteristic of DRG nociceptive neurons and which contribute to the transmission of pain 
messages, and to abnormal firing patterns after nerve injury and in other painftil conditions.. 

C. NaN Nucleic Adds 
Nucleic acid molecules of the invention include the nucleotide sequences set fortti in 
Figures 1, 7A, 8A and 11 A as well as nucleotide sequences that encode the amino acid 
sequences of Figures 2, 7B, 8B and IIB. Nucleic acids of the claimed invention also include 
nucleic acids which specifically hybridize to nucleic acids comprismg the nucleotide 
sequences set forfli in Figures 1, 7A, 8A and 1 1 A, or nucleotide sequences which encode the 
amino acid sequences of Figures 2, 7B, 8B and 1 IB. A nucleic acid which specifically 
hybridizes to a nucleic acid comprising that sequence remains stably bound to said nucleic 
acid under highly stringent or moderately stringent conditions. Stringent and moderately 
stringent conditions are those commonly defined and available, such as those defined by 
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Sambrook et al., (1989) Molecular Cloning - A Laboratory Approach, Cold Spring Harbor 
Press or Ausubel et aL, (1995) Current Protocols in Molecular Biology, Greene Publishing. 
The precise level of stringency is not important, rather, conditions should be selected that 
provide a clear, detectable signal when specific hybridization has occurred. 

Hybridization is a function of sequence identity (homology), G+C content of the 
sequence, buffer salt content, sequence length and duplex melt temperature (T[m]) among 
other variables (see Maniatis et aL, (1982) Molecular Cloning, Cold Spring Harbor Press). 
With similar sequence lengths, the buffer salt concentration and temperature provide useful 
variables for assessing sequence identity (homology) by hybridization techniques. For 
example, where there is at least 90 percent homology, hybridization is commonly carried out 
at 68°C in a buffer salt such as 6xSCC diluted fiiom 20xSSC (see Sambrook et aL, (1989) 
Molecular Cloning - A Laboratory Approach, Cold Spring Harbor Press). The buffer salt 
utilized for final Southern blot washes can be used at a low concentration, e.g., OA^SSC and 
at a relatively high temperature, e.g,, 68 °C, and two sequences will form a hybrid duplex 
(hybridize). Use of the above hybridization and washing conditions together are defined as 
conditions of high stringency or highly stringent conditions. Moderately stringent conditions 
can be utilized for hybridization where two sequences share at least about 80 percent 
homology. Here, hybridization is carried out using 6xSSC at a temperature of about 
50-55**C. A final wash salt concentration of about l-S^SSC and at a temperature of about 
60-68'^C are used. These hybridization and washing conditions define moderately stringent 
conditions. 

In particular, specific hybridization occurs under conditions in which a high degree of 
complementarity exists between two nucleic acid molecules. With specific hybridization, 
complementarity will generally be at least about 70%, 75%, 80%, 85%, preferably about 
90-100%, or most preferably about 95-100%. When referring the human NaN sequence of 
SEQ ID NO:41 and 42, preferred homologous sequences will typically encode an NaN protein 
exhibiting at least about 81% amino acid sequence similarity or at least about 75% or 76% 
sequence identity to SEQ ID NO: 42. A more preferred human NaN sequence will contain a 
positively changed residue at amino acid 670, preferably an arginine residue. 
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As used herein, homology or identity is detennined by BLAST (Basic Local 
Alignment Search Tool) analysis using the algorithm employed by the programs blastp, 
blastn, blastx, tbiastn and tblastx (Karlin et al, (1990) Proc. Natl. Acad. Sci, USA 87, 
2264-2268 and Altschul, (1993) J. Mol. Evol. 36, 290-300, both of which are herein 
incorporated by reference) which are tailored for sequence similarity searching. The approach 
used by the BLAST program is to first consider similar segments betwera a query sequence 
and a database sequence, then to evaluate the statistical significance of all matches that are 
identified and finally to summarize only those matches which satisfy a preselected threshold 
of significance. For a discussion of basic issues in sunilarity searching of sequence databases 
(see Altschul et al, Nat. Genet. (1994) 6, 1 19-129) which is herein mcorporated by reference. 
The search parameters for histogram, descriptions, alignments, expect (i.c., the statistical 
significance threshold for reporting matches against database sequences), cutoff, matrix and 
filter are at the default settings. The default scoring matrix used by blastp, blastx, tbiastn, 
and tblastx is the BLOSUM62 matrix (Henikofif al, (1992) Proc. Natl. Acad. Sci. USA 89, 
10915-10919, herein incorporated by reference). For blastn, the scoring matrix is set by the 
ratios of M (z.e., the reward score for a pair of matching residues) to N (i.e., the penalty score 
for mismatching residues), wherein the default values for M and N are 5 and -4, respectively. 

The nucleic acids of the present invention can be used in a variety of ways in 
accordance with the present invention. For example, they can be used as nucleic acid probes 
to screen other cDNA and genomic DNA libraries so as to select by hybridization other DNA 
sequences that encode homologous AfoT/ sequences. Contemplated nucleic acid probes could 
be RNA or DNA labeled with radioactive nucleotides or by non-radioactive methods (for 
example, biotin). Screening may be done at various stringencies (through manipulation of the 
hybridization Tm, usually using a combination of ionic strength, temperature and/or presence 
of formamide) to isolate close or distantly related homologs. The nucleic acids may also be 
used to generate primers to amplify cDNA or genomic DNA using polymerase chain reaction 
(PCR) techniques. The nucleic acid sequences of the present invention can also be used to 
identify adjacent sequences in the genome, for example, flanking sequences and regulatoiy 
elements of NaN. The nucleic acids may also be used to generate antisense primers or 
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constructs that could be used to modulate the level of gene expression of NaN. The amino 
acid sequence may be used to design and produce antibodies specific to NaN that could be 
used to localize NaN to specific cells and to modulate the function of NaN channels expressed 
on the surface of cells. 

Nucleic acids of the invention also include recombinantly prepared altered NaN 
sequences. For instance, fiision proteins may be prepared with the open reading frames herein 
disclosed, or fimctional firagments thereof, and any available fiision protein. Nucleic acid 
molecules may also be prepared that encode chimeric NaN proteins, for instance, chimeric 
proteins comprismg individual domains bom different species. Such chimeric proteins 
include, but are not limited to, human NaN chimeras containing mouse or rat domains, or 
mouse or rat chimeras containing human domains. Preferred chimeras include human NaN 
with a rat or mouse domain surrounding the residue equivalent to amino acid 670 of human 
NaN. 

D. Vectors and Transformed Host Cells 

The present invention also comprises recombinant vectors containing and capable of 
replicating and directing the expression of nucleic acids encoding a NaN sodium channel in a 
compatible host cell. For example, the insertion of a DNA in accordance with the present 
invention into a vector using enzymes such as T4 DNA Ugase, may be performed by any 
conventional means. Such an insertion is easily accomplished when both the DNA and the 
desired vector have been cut with the same restriction enzyme or enzymes, since 
complementary DNA termini are thereby produced. If this cannot be accomplished, it may be 
necessaiy to modify the cut ends that are produced by digesting back single-stranded DNA to 
produce blunt ends, or by achieving the same result by filling in the single-stranded termini 
with an ^propriate DNA polymerase. In this way, blunt-end ligation may be carried out 
Alternatively, any site desired may be produced by Ugating nucleotide sequences (linkers) 
onto the DNA termini. Such linkers may comprise specific oligonucleotide sequences that 
encode restriction site recognition sequences. 
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Any available vectors and the appropriate compatible host cells may be used 
(Sambrook et aL, (1989) Molecular Cloning - A Laboratory Approach, Cold Spring Harbor 
Press; Ausubel et ai, (1995) Current Protocols in Molecular Biology, Greene Publishing). 
Commercially available vectors, for instance, those available from New England Biolabs, 
Promega, Stratagene or other commercial sources are included. 

The transformation of appropriate cell hosts with an rDNA (recombinant DNA) 
molecule of the present invention is accomplished by well known methods that typically 
depend on the type of vector used and host system employed. Frog oocytes can be injected 
with RNA and will express channels, but in general, expression in a mammalian cell line 
(such as HEK293 or CHO cells) is preferred. With regard to transformation of prokaryotic 
host cells, electroporation and salt treatment methods are typically employed (see, for 
example, Cohen et al, (1972) Proc. Natl. Acad. Sci. USA 69. 2110-21 14; and Maniatis et al, 
(1982) Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Press). With regard to 
transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed (Graham et al, (1973) Virology 52, 
456-467; Wigler e/flt/., Proc. Natl. Acad. Sci. USA (1979) 76. 1373-1376). 

Successfully transformed cells, cells that contain an rDNA molecule of the present 
invention, can be identified by well known techniques. For example, cells resulting from the 
introduction of an rDNA of the present invention can be cloned to produce single colonies. 
Cells from those colonies can be harvested, lysed and their DNA content examined for the 
presence of the rDNA using conventional methods (Southem, (1975) J. Mol. Biol. 98, 503- 
517) or the proteins produced from the cell assayed via an immunological method. If tags 
such as green fluorescent protein are employed in the construction of the recombinant DNA, 
the transfected cells may also be detected in vivo by the fluorescence of such molecules by cell 
sorting. 

For transient expression of recombinant channels, transformed host cells for the 
measurement of Na* current or intracellular Na^ levels are typically prepared by 
co-transfecting constructs into cells such as HEK293 cells with a fluorescent reporter plasmid 
(such as pGreen Lantem-1, Life Technologies) using the calcium-phoq)hate precipitation 
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technique (Ukomadu et aL, (1992) Neuron 8, 663-676). HEK293 cells are typically grown in 
high glucose DMEM (Life Technologies) supplemented with 10% fetal calf serum (Life 
Technologies). After forty-eight hours, cells with green fluorescence are selected for 
recording (Dib-Hajj et al, (1997) FEES Lett. 416, 1 1-14). 

For preparation of cell lines continuously expressing recombinant channels, the NaN 
construct is cloned into other vectors that carry a selectable marker in mammaUan cells. 
Transfections are carried out using the calcium phosphate precipitation technique (Ukomadu 
et al, (1992) Neuron 8, 663-676). Human embryonic kidney (HEK-293), Chinese hamster 
ovaiy (CHO) cells, derivatives of either or other suitable cell lines are grown under standard 
tissue culture conditions in Dulbecco's modified Eagle's medium supplemented with 10% 
fetal bovine serum. The calcium phosphate-DNA mixture is added to the cell culture medium 
and left for 15-20 hours, after which time the cells are washed with fresh medium. After 48 
hours, antibiotic (G418) is added to select for cells which have acquired neomycin resistance. 
After 2-3 weeks in G418, 10-20 isolated cell colonies are harvested using sterile 10 ml pipette 
tips. Colonies are grown for another 4-7 days, spUt and subsequently tested for chaimel 
expression using whole-cell patch-clamp recording techniques and RT-PCR. 

E. Mrthod oirMgayflripg Na"^ Cwrrfflt Flow 

Na^ currents are measured using patch clamp meftiods (Hamill et al, (1981) Pfliigers 
Arch. 391, 85-100), as described by Rizzo et al, (1994) J. Neurophysiol. 72, 2796-2815 and 
Dib-Hajj et al, (1997) FEBS Lett. 416, 11-14, For these recordings data are acquired on a 
Macintosh Quadra 950 or similar computer, using a program such as Pulse (v 7,52, HEKA, 
German). Fire polished electrodes typically (0.8-1 .5 MW) are fabricated from capillary glass 
using a Sutter P-87 puller or a similar instrument. In the most rigorous analyses, cells are 
usually only considered for analysis if initial seal resistance is <5 Gohm, they have high 
leakage currents (holding curr^t <0.1 nA at -80 mV), membrane blebs, and an access 
resistance <5 Mohm. Access resistance is usually monitored throughout the experiment and 
data are not used if resistance changes occur. Voltage enors are minimized using series 
resistance compensation and the capacitance artifact is canceled using computer controlled 
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amplifier circuitry or other similar methods. For comparisons of the voltage dependence of 
activation and inactivation, cells with a maximum voltage error of ±10mV after compensation 
are used. Linear leak subtraction is usually used for voltage clamp recordings. Membrane 
currents are typically filtered at 5 KHz and sampled at 20 KHz. The pipette solution contains 
a standard solution such as: 140 mM CsF, 2 mM MgCl, 1 mM EGTA, and 10 mM Na-HEPES 
(pH 7.3), The standard bathing solution is usually 140 nM NaCl, 3 mM KCl, 2 mM MgCh, 1 
mM CaCb. 10 mM HEPES, and 10 mM glucose (pH 7.3). 

Voltage clamp studies on transformed cells or DRG neurons, using methods such as 
intracellular patch-clamp recordings, can provide a quantitative measiu^ of the sodium current 
density (and thus the number of sodium chatmels in a cell), and chaimel physiological 
properties. These techniques, which measure the currents that flow through ion chaimels such 
as sodium channels, are described in Rizzo et al, (1995) Neurobiol. Dis. 2, 87-96. 
Alternatively, the blockage or enhancement of sodium chaxmel function can be measured 
using optical imaging with sodium-sensitive dyes or with isotopically labeled Na. These 
methods which are described in Rose et al, (1997) J. Neurophysiol. 78, 3249-3258 and by 
(Kimelberg & Waltz, (1988) The Neuronal Microenviromnent (Boulton et al, editors) 
Humana Press), measure the increase in intracellular concentration of sodium ions that occurs 
when sodium chaimels are open. 

F* Measurement oflntracellular Sodium [Na^|, 
The effects of various agents on cells that express Na* can be determined using ratiometric 
imaging of [Na^Jj using SBFI or other similar ion-sensitive dyes. In this method, as described 
by Sontheimer et aL, (1994) J. Neurosci. 14, 2464-2475, cytosolic-free Na* is measured using 
an indicator for Na*, such as SBFI (sodium-binding benzofuran isophthalate (Harootunian et 
al, (1989) J. BioL Chem. 264, 19458-19467)); or a similar dye. Cells are first loaded with the 
membrane-penneable acetoxymethyl ester form of the dye (which is dissolved in dimethyl 
sulfoxide (DMSO) at a stock concentration of 10 mM). Recordings are obtained on tfie stage 
of a microscope using a ratiometric imaging setup (eg., fi^om Georgia Instruments). 
Excitation light is provided at appropriate wavelengths (eg., 340:385 nm). Excitation light is 
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passed to the cells through a dichroic reflector (400 nm) and emitted light above 450 nm is 
collected. Fluorescence signals are amplified, e.g., by an image intensifier (GenllSyS) and 
collected with a CCD camera, or similar device, mterfaced to a frame grabber. To account for 
fluorescence rundown, the fluorescence ratio 340:385 is used to assay cytosolic-free Na^ 

For cahbration of SBFI's fluorescence, cells are perfiised with calibration solutions 
containing known Na^ concentrations (typically 0 and 30 mM, or 0, 30. and 50 mM [Na*]), 
and with ionophones such as gramicidin and monensin (see above) after each experiment. As 
reported by Rose & Ransom, (1996) J. Physiol (Lond) 491, 291-305, the 345/390 nm 
fluorescence ratio of intracellular SBFI changes monotonically with changes m [Na^j. 
Experiments are typically repeated on multiple (typically at least four) different coverslips, 
providing statistically significant measurements of intracellular sodium in control cells, and in 
cells exposed to various concentrations of agents that may block, inhibit or enhance Na\ 

G. Method to Measure Na* Influx via Measuring ^Na or "Rh 
^^a is a gamma emitter and can be used to measure Na^ flux (Kimelberg & Waltz, 
(1988) The Neuronal Microenvironment (Boulton et aL, editors) Humana Press), and ^^Rb"" 
can be used to measure Na'^/K'^-ATPase activity (Sontheimer et al, (1994) J. Neurosci, 14, 
2464-2475). « W ions are taken up by the Na^/K^-ATPase-like ions, but have the 
advantage of a much longer half-life than ^^K"" (Kimelberg & Mayhew (1975) J. Biol. Chem. 
250, 100-104). Thus, measurement of the unidirectional ouabain-sensitive ^^Rb"" uptake 
provides a quantitative method for assaying NaW- ATPase activity which provides another 
indicator of the electrical firing of nerve cells. Following incubation of cells expressing NaN 
with the isotope ^a% the cellular content of the isotope is measured by liquid scintillation 
counting or a similar method, and cell protein is determined using a method such as the 
bicinchoninic acid protein assay (Smith et aL, (1985) Anal. Biochem. 150, 76-85) following 
the modifications described by Goldschmidt & Kimelberg (1989) Anal. Biochem. 177, 41-45 
for cultured cells, ^a and *^Rb* fluxes are determined in the presence and absence of agents 
that may block, inhibit, or enhance NaN. This permits determination of the actions of these 
agents on NaN, 
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H. Method to Identify Agents that Modulate NaN-Mediated Current 

Several approaches can be used to identify agents that are able to modulate (ue,, 
block or augment) the Na^ current through the NaN sodium channel In general, to identify 
such agents, a model cultured cell line that expresses the NaN sodium channel is utilized^ and 
one or more conventional assays are used to measure Na"^ current Such conventional assays 
include^ for example, patch clamp methods, the ratiometric imaging of [Na^], and the use of 
^a and ^%b as described above. 

In one embodiment of the present invention, to evaluate the activity of a candidate 
compound to modulate Na^ current, an agent is brought into contact with a suitable 
transformed host cell that expresses NaN, After mixing or appropriate incubation time, the 
Na^ current is measured to determine if the agent inhibited or enhanced Na*^ current flow. 

Agents that inhibit or enhance Na^ current are thereby identified. A skilled artisan can 
readily employ a variety of art-recognized techniques for determining whether a particular 
agent modulates the Na^ current flow. 

Because Na^ is preferentially expressed in pain-signaling cells, one can also design 
agents that block, inhibit, or enhance Na*^ channel function by measuring the response of 
laboratory animals, treated with these agents, to acute, inflammatory or chronic pain. In one 
embodiment of this aspect of the invention, laboratory animals such as rats are treated with an 
agent for instance, an agent that blocks or inhibits (or is thought to block or inhibit) NaN. The 
response to various painful stimuli are then measured using tests such as the tail-flick test and 
limb withdrawal reflex, and are compared to imtreated controls. These methods are described 
by Dubner, (1994) Textbook of Pain (Wall & Melzack, editors) Churchill Livingstone 
Publishers. In another embodiment of this aspect of the invention, laboratory animals such as 
rats are subjected to localized injection of pam-producing inflammatory agents such as 
formalin (Dubuisson & Dennis (1977) Pain 4, 161-74), Freunds adjuvant (ladarola et a/., 
(1988) Pain 35, 313-326) or carageenan, or are subjected to nerve constriction (Bennett & Xie, 
(1988) Pain 33, 87-107; Kim & Chung (1992) Pain 50, 355-363) or nerve transection (Seltzer 
et aL^ (1990) Pain 43, 205-218) which produce persistent pain. The response to various 
normal and painful stimuli are then measured, for example, by measuring the latency to 
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withdrawal from a wann or hot stimulus (Dubner, (1994) Textbook of Pain (Wall & Melzack, 
editors) Churchill Livingstone Publishers) so as to compare control animals and animals 
treated with agents that are thought to modify NaN. 

The preferred inhibitors and enhancers of NaN preferably will be selective for the NaN 
6 Na* channel. They may be totally specific (like tetrodotoxin, TTX, which inhibits sodium 
channels but does not bind to or directly effect any other channels or receptors), or relatively 
^ecific (such as lidocaine which binds to and blocks several types of ion channels, but has a 
predilection for sodium channels). Total specificity is not required for an inhibitor or 
enhancer to be efficacious. The ratio of its effect on sodium channels vs. other channels and 

10 receptors, may often detemiine its effect and effects on several channels, in addition to the 
targeted one, may be efiScacious (Stys et aL, (1992) J. NeurophysioL 67, 236-240). 
Modulators of NaN may be combined with or coadministered with agents that modulate other 
channels expressed in priniaiy sensory neurons, including but not limited to PNl/hNE and 
SNS/PN3 (Waxman (1999) Pain Supplement 6:8133-140), 

15 It is contemplated that modulating agents of the present invention can be, as examples, 

peptides, small molecules, naturally occurring and other toxins and vitamin derivatives, as 
well as carbohydrates. A skilled artisan can readily recognize that there is no limit as to the 
structural nature of the modulating agents of the present invention. Screening of libraries of 
molecules may reveal agents that modulate NaN or current flow through it. Similarly, 

20 naturally occurring toxins (such as those produced by certain fish, amphibians and 
invertebrates) can be screened. Such agents can be routinely identified by exposing a 
transformed host cell or other cell which expresses a sodium channel to these agents and 
measuring any resultant changes in Na^ current. 



I. Recombinant Protein Expression^ Synthesis and Parification 
25 Recombinant NaN proteins can be expressed, for example, in E. coli strains HBlOl , 

DHSa or the protease deficient strain such as CAG-456 and purified by conventional 
techniques. 
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The pq)tide ag^ts of the invention can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oUgonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant production 
systems. The production using solid phase peptide synthesis is necessitated if 
non-gene-encoded amino acids are to be included. 

J. Antibodies and Immunodetection 

Another class of agents of the present invention are antibodies immunoreactive with 
the Na^ channel. These antibodies may block, inhibit, or enhance the Na* cuirent flow 
through the channel. Antibodies can be obtained by immunization of suitable mammalian 
subjects with peptides, containing as antigenic regions, those portions of NaN, particularly 
(but not necessarily) those that are exposed extracellularly on the cell surface. Such 
immunological agents also can be used in competitive binding studies to identify second 
generation inhibitoiy agents. The antibodies may also be usefiil in imaging studies, once 
appropriately labeled by conventional techniques. 

K. Production of Transgenic Animals 

Transgenic animals containing and mutant, knock-out or modified NaN genes are 
also included in fiie invention. Transgenic animals wherein both NaN and the SNS/PN3 gene 
are modified, disrupted or in some form modified are also included in the present invention. 
Transgenic animals are genetically modified animals into which recombinant, exogenous or 
cloned genetic material has been experimentally transferred. Such genetic material is often 
referred to as a "transgene". The nucleic acid sequence of the transgene, in this case a form of 
NaN, may be integrated either at a locus of a genome where that particular nucleic acid 
sequence is not otherwise normally found or at the normal locus for the transgene. The 
transgene may consist of nucleic acid sequences derived fi*om the genome of the same species 
or of a different species than the species of the target animal. 

The term "germ cell line transgenic animal" refers to a transgenic animal in which the 
genetic alteration or genetic information was introduced into a germ line cell, thereby 
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conferring the ability of the transgenic animal to transfer the genetic information to offspring. 
If such offspring in fact possess some or all of that alteration or genetic information, then they 
too are transgenic animals. 

The alteration or genetic information may be foreign to the species of animal to which 
the recipient belongs, foreign only to the particular individual recipient, or may be genetic 
information aheady possessed by the recipient. In the last case, the altered or introduced gene 
may be expressed differently than the native gene. 

Transgenic animals can be produced by a variety of different methods including 
transfection, electroporation, microinjection, gene targeting in embryonic stem cells and 
recombinant viral and retroviral infection (see, eg., U.S. Patent No. 4,736,866; U.S. Patent 
No. 5,602,307; MuUins et aL, (1993) Hypertension 22, 630-633; Brenin et aL, (1997) Surg. 
Oncol. 6, 99-1 10; Tuan (1997) Recombinant Gene Expression Protocols, Humana Press). 

A number of recombinant or transgenic mice have been produced, including those 
which express an activated oncogene sequence (U.S. Patent No. 4,736,866); express simian 
SV 40 T-antigen (U.S. Patent No. 5,728,915); lack the expi^ion of interferon regulatory 
factor 1 (BRF-l) (U.S. Patent No. 5,731,490); exhibit dopaminergic dysfunction (U.S. Patent 
No. 5,723,719); express at least one human gene which participates in blood pressure control 
(U.S. Patent No. 5,731,489); display greater similarity to the conditions existing in naturally 
occurring Alzheimer's disease (U.S. Patent No. 5,720,936); have a reduced capacity to 
mediate cellular adhesion (U.S. Patent No. 5,602,307); possess a bovine growth hormone gene 
(Clutter et al, (1996) Genetics 143, 1753-1760); or, are capable of generating a fully human 
antibody response (McCarthy, (1997) Lancet 349, 405). 

While mice and rats remain the animals of choice for most transgenic experimentation, 
in some instances it is preferable or even necessary to use alternative animal species. 
Transgenic procedures have been successfully utilized in a variety of non-murine animals, 
mcluding sheep, goats, pigs, dogs, cats, monkeys, chimpanzees, hamsters, rabbits, cows and 
guinea pigs (see Kim et al, (1997) Mol. Reprod. Dev. 46, 515-526; Houdebine, (1995) 
Reprod. Nutr. Dev. 35, 609-617; Fetters (1994) Reprod. FertiL Dev. 6, 643-645; Schnieke et 
aU (1997) Science 278, 2130-2133; Amoah, (1997) J. Animal Sci. 75, 578-585). 
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The method of introduction of nucleic acid fragments into recombination competent 
mammalian cells can be by any method which favors co-transformation of multiple nucleic 
acid molecules. Detailed procedures for producing transgenic animals are readily available to 
one skilled in the art, including the disclosures in U.S. Patent No. 5,489,743 and U.S. Patent 
No. 5,602,307. 

The specific examples presented below are illustrative only and are not intended to 
limit the scope of the inventioa 

EXAMPLES 

Example 1: Cloning and Characterization of the Rat NaN Coding Sequence 

a. RNA PmaraUQn 

Dorsal root ganglia (DRG) from the lumber region (L4-L5) were dissected from adult 
Sprague-Dawley rats and total cellular RNA was isolated by the single step guanidinum 
isothiocyanate-acid phenol procedure (Chomczynsld, (1987) Anal. Biochem. 162, 156-159). 
For analytical applications, DRG tissues were dissected from a few animals at a time. The 
quality and relative yield of the RNA was assessed by electrophoresis in a 1% agarose gel. 
Because of the limited starting material (four DRCrs weigh on average 1 0 mg), quantifying the 
RNA yield was not attempted. PolyA+ RNA was purified fi^om about 300 mg of total DRG 
RNA (28 animals) using the PolyATract isolation system according to the manufacturers 
recommendations (Promega). Half of the purified RNA was used for the preparation of 
Marathon cDNA (see below) without fiirther quantification. 

b. Reverse Transcription 

For analytical applications, first strand cDNA was synthesized essentially as 
previously described (Dib-Hajj et al, (1996) FEBS Lett. 384, 78-82). Briefly, total RNA was 
reverse transcribed in a 25 ml final volume using ImM random hexamer (Boehringer 
Mannheim) and 500 units Superscript II reverse transcriptase (Life Technologies) in the 
presence of 100 tmits of RNase Inhibitor (Boehringer Mannheim). The reaction buffer 
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consisted of 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCh, 10 mM DTT and 125 mM 
dNTP. The reaction was allowed to proceed at 37 °C for 90 minutes, 42 ""C for 30 minutes, 
then terminated by heating to 65 ''C for 10 minutes. 

c. First'Strand cDNA Synthesis 

The Marathon cDNA synthesis protocol was followed according to the 
manufacturer's instruction as summarized below (all buffers and enzymes are purchased from 
the manufacturer (Clontech): 

Combine the following reagents in a sterile 0.5-ml microcentrifuge tube: 1 mg (1-4 
ml) PolyA* RNA sample, one ml cDNA Synthesis Primer (10 mM) and sterile water to a final 
volume of 5 ml Mix contents and spin the tube briefly in a microcentrifuge. Incubate the 
mixture at 70*'C for two minutes, then immediately quench the tube on ice for two minutes 
Touch-spin the tube briefly to collect the condensation. Add the following to each reaction 
tube: 2 ml 5x First-Strand Buffer, 1 ml dNTP Mix (10 mM), 1 ml [a-^^pj^^P (1 mCi/ml), 1 
ml AMV Reverse Transcriptase (20 units/ml) for a 10 ml volume. The radiolabeled dCTP is 
optional (used to detennine yield of cDNA) and is replaced by sterile H2O if not used. Mix 
the contents of the tube by gently pipetting and touch-spin the tube to collect the contents at 
the bottom. Incubate the mixture at 42**C for one hour in an air incubator to reduce 
condensation and enhance the yield of the first strand cDNA. Place the tube on ice to 
terminate first-strand synthesis. 

d. Second-Strand cDNA Synthesis 

Combine the following components in the reaction tube from above: 48.4 ml Sterile 
water, 16 ml 5x Second-Strand Buffer, 1.6 ml dNTP Mix (10 mM), 4 ml 20x Second-Strand 
Enzyme Cocktail for an 80 ml total volume. Mix the contents thoroughly with gentle 
pipetting and spin the tube briefly in a microcentrifuge. Incubate the mixture at 16**C for 1 .5 
hours then add 2 ml (10 units) of T4 DNA Polymerase, mix thorougihly with gentle pipetting 
and incubate the mixture at IS'^C for 45 minutes. Add 4 ml of the EDTA/Glycogen mix to 
terminate second-strand synthesis. Extract the mixture with an equal volume of buffer- 
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saturated (pH 7.5) phenolxhlorofonniisoamyl alcohol (25:24:1). Mix the contents thoroughly 
by vortexing and spin the tube in a microcentrifuge at maximum speed (up to 14,000 rpm or 
ISOOQxg), 4''C for ten minutes to separate layers. Carefully transfer the top aqueous layer to a 
clean 0.5-ml tube. Extract the aqueous layer with 100 ml of chloroform:isoamyl alcohol 
(24:1), vortex, and spin the tube as before to separate the layers. Collect fte top lay^ into a 
clean 0.5-ml microcentrifuge tube. Ethanol precipitate the double-straaded cDNA by adding 
one-half volume of 4 M Ammonium Acetate and 2.5 volumes of room-temperature 95% 
ethanol. Mix thoroughly by vortexing and spin the tube immediately in a microcentrifuge at 
top speedy room temperature for twenty minutes Remove the supernatant carefully and wash 
the pellet with 300 ml of 80% ethanol. Spin the tube as before for 10 minutes and carefully 
remove the supernatant. Air dry the pellet for up to 10 minutes and dissolve the cDNA in 10 
ml of sterile H2O and store at -20**C. Analyze the yield and size of cDNA by running 2 ml of 
the cDNA solution on a 1.2% agarose/EtBr gel with suitable DNA size markers (for example, 
the 1 kilobp ladder, Gibco-BRL). If EtBr staining does not show a signal and [a-*T]dCTP 
was included in the reaction, dry the agarose gel on a vacuum gel diying system and expose an 
x-ray film to the gel overnight at -70**C. 

e. Adaptor Ligation 

Combine these reagents in a 0.5-ml microcentrifuge test tube, at room temperature, 
and in the following order: 5 ml double-stranded cDNA, 2 ml Marathon cDNA Adaptor (10 
mM), 2 ml 5x DNA Ligation Buffer, 1 ml T4 DNA Ligase (1 unit/ml) for a 10 ml final 
volimie. Mix the contents thoroughly with gentle pipetting and spin the tube briefly in a 
microcentrifuge. Incubate at either: 16''C overnight; or room temperature (19-23 °C) for three 
to four hours. Inactivate the ligase enzyme by heating the mixture at 70 °C for five minutes. 
Dilute 1 ml of this reaction mixture with 250 ml of Tricine-feDTA buffer and use for RACE 
protocols. Store the undiluted adaptor-ligated cDNA at -20*'C for future use. 
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f. PCR 

For the initial discovery ofNaN, we used generic primers designed against highly 
conserved sequences in domain 1 (Dl) of a-subunits I, II and HI and later added more primers 
to accommodate the new a-subunits that were discovered. Thus, generic primers were used 
that recognize conserved sequences in all known Na^ channels. The middle of the amplified 
region shows significant sequence and length polymorphism (Figure 6) and (Gu et aL, (1997) 
J. NeurophysioL 77, 236-246; Fjell et al, (1997) MoL Brain Res. 50, 197-204). Due to codon 
degeneracy, 4 forward primers (F1-F4) and 3 reverse primers (R1-R3) were designed to ensure 
efficient priming fiom all templates that might have been present m the cDNA pool (Table 1); 
however, any of these primers may bind to multiple templates d^ending on the stringency of 
the reaction. Forward primer Fl matches subunits al, alll; aNa6; aPNl; ami, arHl and 
aSNS/PN3. Sequences of individual subunits show 1 or 2 mismatches to this primer: T to C at 
position 16 and A to G at position 18 (aNa6); C to R at position 6 (ami); A to G at position 18 
(arHl) and T to C at position 3 (aSNS). Forward primer F2 matches subunit all. Forward 
primer F3 perfectly matches aNa6 and also matches arHl with a single mismatch of C to T at 
position 16. Reverse primer Rl matches subunits al, all, alH, aNa6, aPNl, ami and arHl. 
This primer has mismatches compared to 4 subunits: G to A at position 3, A to G at position 4 
and T to G at position 7 (al); T to C at position 1 and A to G at position 19 (aPNl); G to A at 
position 3 and A to G at position 7 (ami); an extra G after position 3, GC to CT at positions 
14-15, and A to T at position 21 (arHl). Reverse primer R2 matches subunit aSNS/PN3. 
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Table 1 : Generic and MiiV-specific primers used for the identification and cloning of NaN. 
All primers except the marathon primers, were synthesized at the department of Pathology, 
Program for Critical Technologies in Molecular Medicine, Yale University. 



Forward Primers 


Reverse Primers 


1. GACCCRTGGAATTGGTTGGA 
(SEQIDNO:9) 


1. CAAGAAGGCCCAGCTGAAGGTGTC 
(SEQIDNO: 15) 


2. AATCCCTGGAATTGGTTGGA 
(SEQIDNO: 10) 


2. GAGGAATGCCCACGCAAAGGAATC 
(SEQIDNO: 16) 


3. GACCCGTGGAACTGGTTAGA 
(SEQIDNO: 11) 


3. AAGAAGGGACCAGCCAAAGTTGTC 
(SEQIDNO: 17) 


4. GATCTTTGGAACTGGCTTGA 
(SEQIDNO: 12) 


4. ACYTCCATRCANWCCCACAT 
(SEQIDNO: 18) 


5. AACATAGTGCTGGAGTTCAGG 
(SEQIDNO: 13) 


5. AGRAARTCNAGCCARCACCA 
(SEQIDNO: 19) 


6. GTGGCCTTTGGATTCCGGAGG 
(SEQIDNO: 14) 


6. TCTGCTGCCGAGCCAGGTA 
(SEQ ID NO: 20) 




7. CTGAGATAACTGAAATCQCC 
(SEQ E) NO: 21) 


Marathon AP-1 CCATCCTAATACGACTCACTATAGGGC (SEQ ID NO: 22) 


Marathon AP-2 ACTCACTATAGGGCTCGAGCGGC (SEQ ID NO: 23) 



The respective mouse atypical sodium channel mNav2.3 sequence was used to design forward 
primer F4 and reverse primer R3 to amplify the analogous sequence from aNaG, the presumed 
rat homolog of mNav2.3 (Felipe et al, (1994) J. Biol. Chem. 269, 30125-30123 1). The 
amplified sequence was cloned into the Srfl site of the vector pCR-Script (Stmtagene). The 
nucleotide sequence of this fragment shows 88% identity to the respective sequence of 
mNav2.3 (Dib-Hajj & Waxman, unpublished). The restriction enzyme Xba I was found to be 
unique to this subunit. Recently, the sequence of a fiill length cDNA clone of putative sodium 
channel, NaG-like (SCL-1 1 :Y09164), subunit was published (Akopian et al, (1997) FEBS 
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Lett. 400, 183-187). The published sequence is 99% identical to our sequence and confirms 
the size and restriction enzyme polymorphism of the NaG PCR product 

The predicted lengths of amplified products and subunit-specific restriction enzyme 
recognition sites are shown in Figure 6. All subunit sequences are based on Genbank database 
(accession numbers: al: X03638; aU: X03639; alH: Y00766; aNa6: L39018; ahNE-Na: 
X82835; ami M26643; arHl M27902 and aSNS X92184; mNa2.3 L36719). 
Subsequently, amplification of NaN sequences 3* terminal to the aforementioned fiagment 
was achieved using NaN-specific primers and two generic reverse primers, R4 and R5. The 
sequence of the R4 primer was based on the amino acid sequence MWV/DCMEV (SEQ ID 
NO: 38) located just N-terminal to domain n S6 segment (see schematic diagiam of Figure 3 
of voltage^gated sodium channel a-subunits for reference). The sequence of the R5 primer is 
based on flie amino acid sequence AWCWLDFL (SEQ ID NO: 43) which forms the 
N-terminal portion of domain in S3 segment. 

Amplification was typically performed in 60 ml volume using one ml of the first 
strand cDNA, O.SmM of each primer and 1.75 units of Expand Long Template DNA 
polymerase enzyme mixture (Boehringer Mannheim). Compared to conventional and 
ttiermostable DNA polymerases. Expand Long Template enzyme mixture increases the yield 
of the PCR products without an increase in non-specific amplification (Barnes, (1994) Proc. 
Natl. Acad. Sci. USA 91, 2216-2220; Cheng et al., (1994) Proc. Natl Acad, Sci. USA 91, 
5695-5699). The PCR reaction buffer consisted of 50 mM Tris-HCl (pH 9.2), 16 mM 
(NH4)2S04, 2.25 mM MgCh, 2% (v/v) DMSO and 0.1% Tween 20. As described previously 
(Dib-Hajj et aL, (1996) FEBS Lett. 384, 78-82), amplification was carried out m two stages 
using a programmable thermal cycler (PTC-200, MJ Research). First, a denaturation step at 
94°C for four minutes, an annealing step at 60°C for two minutes and an elongation step at 
72°C for 90 seconds. Second, a denaturation step at 94°C for one minute, an annealing step at 
60°C for one minute and an elongation step at 72 ^'C for 90 seconds. The second stage was 
repeated 33 times for a total of 35 cycles, with the elongation step in the last cycle extended to 
ten minutes 
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Primary RACE amplification was performed in 50 ml final volume using 4 ml diluted 
DRG marathon cDNA template, 0.2 mM marathon AP-1 and NaN-specific primers, 3.5 units 
Expand Long Template enzyme mixture. Extension period was adjusted at 1 minute per 800 
base pairs based on the expected product. 5' and 3' RACE amplification was performed using 
primer pairs marathon AP-l/A/iaA^-specific R6 and iVaiV-specific F5/marathon AP-1, 
respectively. The PCR reaction buffer consisted of 50 mM Tris-HCl (pH 9.2), 16 mM 
(NH4)2S04, 3.0 mM MgCh, 2% (v/v) DMSO and 0. 1% Tween 20. Amplification in three 
stages was performed in a programmable thermal cycler (PTC-200, MJ Research). An initial 
denaturation step at 92°C was earned out for two minutes This was followed by 35 cycles 
consisting of denaturation at 92°C for 20 seconds, annealing step at 60°C for one minute, and 
an elongation step at 68 ''C. Finally, an elongation step at 68 °C was carried out for five 
minutes Nested amplification was performed using 2 ml of a 1/500 diluted primary RACE 
product in a final volume of 50 ml under similar conditions to the primary RACE reactions. 
Primer pairs AP-2/NaN-specific R7 and NaN-specific F6/marathon AP-2 were used for nested 
5* and 3' RACE, respectively. Secondary RACE products were band isolated from 1% 
agarose gels and purified using Qiaex gel extraction kit (Qiagen). 

A schematic diagram of the putative structure of NaN is shown in Figure 3. The 
length of the intracellular loops is highly variable both in sequence and length among tiie 
various subunits. The exception is the loop between domains HI and IV. 

Example 2: Determination of the Putative Rat Amino A dd Sequence for the NaN Channel 
iViiA^related clones and secondary RACE fragments were sequenced at the W. M. 
Keck Foundation Biotechnology Resource Lab, DNA sequencing group at Yale University. 
Sequence analysis including determination of the predicted amino acid sequence was 
performed using commercial softwares, Lasergene pNAStar) and GCG. The putative amino 
acid sequence of NaN is shown in Figure 2. Predicted transmembrane segments of domains I 
- IV are underlined. 
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Examples: Detemination of the Murine NaN Sequence 

Total RNA extraction from trigeminal ganglia of mice, purification of polyA+ RNA, 
and Marathon cDNA construction were done as previously described for the rat The initial 
amplification was performed using rat iViaA^ primers. The forward primer corresponds to 
nucleotides 765-787 of the rat sequence (5' CCCTGCTGCGCTCGGTGAAGAAG 3*) (SEQ 
ID NO: 24), and the reverse primer corresponds to nucleotides 1 156-1 137 (negative strand) of 
the rat sequence (5* GACAAAGTAGATCCCAGAGG 3') (SEQ ID NO: 25). The 
amplification produced a fi-agment of the expected size. The sequence of this fi-agment 
demonstrated high similarity to rat NaN, Other fi-agments w^e amplified using different rat 
primers and primers designed based on the new mouse MiiV sequence that was being 
produced. Finally, longer Segments were amplified using mouse Marathon cDNA template 
and mouse A^oiV^specific primers in combination with adaptor primers that were introduced 
during the Marathon cDNA synthesis. These fragments were sequenced using primer walking 
and assembled into Figure 7 A. 

Mouse NaN nucleotide sequence, like rat NoNy lacks the out-of-firame ATG at the -8 
position relative to the translation initiation codon ATG at position 41 (Figure 7A). 
Translation termination codon TGA is at position 5314. A polyadenylation signal (AATAAA) 
is present at position 5789 and a putative 23 nucleotide polyA tail is present beginning at 
position 5800. The sequence encodes an ORF of 1765 ammo acids (Figure 7B), which is 90% 
similar to rat NaN. The gene encoding NaN has been named Scnl la. 

Chromosomal localization of mouse NaN 

A genetic polymorphism between strains C57BL/6J and SPRET/Ei was identified by 
SSCP analysis of a 274 bp fragment from the 3'UTR of Scnl la. Genotypmg of 94 animals 
fix)m the BSS backcross panel (Rowe et al, (1994) Mamm. Genome 5, 253-274) 
demonstrated linkage of Scnl la with markers on distal chromosome 9 (Figure 10). No 
recombinants were observed between Scnl la and the microsatellite marker D9Mitl9, 
Comparison of our data with the MGD consensus map of mouse chromosome 9 revealed close 
linkage of Scnl la with the two other TTX-R voltage-gated sodium channels, ScnSa (George 
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et al, (1995) Cytogenet. Cell. Genet. 68, 67-70) and ScnlOa (Kozak & Sangameswaran, 
(1996) Manun. Genome 7, 787-788; Souslovae/ a/., (1997) Genomics 41, 201-209). 

Example 4: Determinati on of a Partial and Complete Human NaN Coding Sequence 

Human DRG tissue was obtained from a transplant donor. Total RNA extraction and 
cDNA synthesis were performed as described previously. 

Forward primer corresponds to sequence 310-294 (minus strand) of EST AA446878. 
The sequence of the primer is 5' CTCAGTAGTTGGCATGC 3* (SEQ ID NO: 26). Reverse 
primer corresponds to sequence 270-247 (minus strand) of EST AA88521 1. The sequence of 
the primer is 5'GGAAAGAAGCACGACCACACAGTC 3' (SEQ ID NO: 27). Amplification 
was performed as previously described. PGR amplification was successfiil and a 2. 1 kbp 
firagment was obtained. This fragment was gel purified and sent for sequencing by primer 
walking, similar to what is done for mouse NaK The sequence of the ESTs is extended in 
both directions; the additional sequence shows highest similarity to rat and mouse NaN^ 
compared to the rest of the subunits. 

The sequence of a human 2.1 kbp fragment was obtained using the PGR forward and 
reverse primers for sequencing from bofli ends of the fragment. Two additional primers were 
used to cover the rest of the sequence. The sequence was then extended in the 5' direction 
using forwani primer 1 (above) and human NaN reverse primer (5'- 
GTGCCGTAAACATGAGAGTGTCG3') (SEQ ID NO: 44) near the 5' end of the 2.1 kbp 
fragment The partial amino acid sequence is set forth in Figure SB. 

The partial ORF of the human MziV consists 1241 amino acids. The sequence is 64% 
identical to the corresponding sequence of rat NaN (73Vo similar, allowing for conservative 
substitutions) using the advanced BLAST program available at the National Center for 
Biotechnology Information. Using the Clustal method of aligmnent (Lasergene software, 
DNAStar) the human NaN is 68% and 69% similar to mouse and rat NaN, respectively. The 
respective mouse and rat sequences are 88% similar. 

Further sequencing revealed the cDNA sequence spannmg the fiill length open reading 
fi^e for the human NaN gene. This extended sequence is shown in Figure 1 1 A (SEQ ID 
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NO: 41). In addition to the features noted with regard to the partial cDNA sequence (Figure 
8 A), notable features of the extended sequence include a translation start codon (ATG) at 
position 31 and a translation termination codon at position 5400. A recognizable 
polyadenylation signal has not been observed and presumably is located 3' of the disclosed 
5 sequence. The putative amino acid sequence of the hiunan Nan protein is set forth in Figure 
11B(SEQIDN0:42). 

Examples: Isolation of an Alternative Splicing Variant of Rat NaN 

A rat NaN cDNA that encodes a C-terminal truncated version of the full-length rat 
NaN in Figures 1 and 2 was isolated by sequencing the insert of a rat cDNA clone. The 

10 variant NaN cDNA encodes an NaN protein lacking the 387 C-terminal amino acids of the fiill 
length NaN and containing a novel 94 amino acid stretch at the C-terminal end. The new 
sequence arises from the use of a cryptic donor splice site in exon 23 and a novel exon 23 ' , 
which is located in intron 23. Thee novel C terminal amino acids are: AAGQAMRKQG 
DILGPNIHQF SQSSETPFLG CPQQRTCVSF VRPQRVLRVP WFPAWRTVTF 

15 LSRPRSSESS AWLGLVESSG WSGLPGESGP SSLL (SEQ ID NO: 28). The N-terminal 
amino acids of the truncated variant are identical to amino acids 1-1378 of the full length rat 
NaN of Figure 2. The alternative exon and the splicing pattern was confirmed by comparing 
the cDNA sequence and the genomic sequence in the req)ective region. 

20 Example 6: Methods to Isolate Other Afajy Sequences 

a. Isolation of NaN sequences from genomic DNA 

The genomic structure of three voltage-gated Na^ channel a-subunits have already 
been detennined (George et aL, (1993) Genomics 15, 598-606; Souslova et al, (1997) 
Genomics 41, 201-209; McClatchey et ai, (1992) Hum. Mol. Genet. 1, 521-527; Wang et ai, 
25 (1996) Genomics 34, 9-16). These genes bear remarkable similarity in their organization and 
provide a predictable map of most of the exon/intron boundaries. Based on the available rat, 
mouse and human cDNA sequence of NaN, disclosed herein, PCR primers are designed to 
amplify iV'o^ homologous sequences from other species using standard PCR protocols. 
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Aitematively, commercially available genomic DNA libraries are screened with 
^!fltMspecific probes (based on the rat, mouse, or more preferably, the human sequence) using 
standard library screening procedures (Sambrook et aL, (1989) Molecular Clonmg - A 
Laboratory Approach, Cold Spring Harbor Press; Ausubel et aL, (1995) Current Protocols in 
Molecular Biology, Greene Publishing). This strategy yields genomic DNA isolates that can 
then be sequenced and the exon/intron boundaries determined by homology to the rat, mouse 
or human cDNA sequence. 

b. Isolation of full length NaN sequenc e allelic varianis&om autopsy or biopsy 

tissues 

For isolation of human ganglia total RNA, a full length NaNhumm cDNA 
homologue is isolated 6om hxraian dorsal root ganglia or trigeminal ganglia or other cranial 
ganglia from post-mortem human material, foetuses or biopsy or surgical tissues. Total 
ribonucleic acid (RNA) is isolated from these tissues by extraction in guanidinium 
isothiocyanate (Saiki etai, (1985) Science 230, 1350-1354) as described in Example 1. 

For Determination of the full length transcript size of the human homologue of the rat 
NaN sodiimi channel cDNA, the method of determining transcript size is as described in 
Example 9. 

Example?: Production of human DRG cDNA library 

A cDNA library from human DRG or trigeminal ganglia polyA+ RNA was prepared 
in Example 4 using standard molecular biology techniques (Sambrook et aL, (1989) Molecular 
Cloning - A Laboratory Approach, Cold Spring Harbor Press; Ausubel et al., (1995) Current 
Protocols in Molecular Biology, Greene Publishing, 

PolyA+ mRNA is hybridized to an oligo(dT) primer and the RNA is copied by 
reverse transcriptase into single strand cDNA. Then, the RNA in the RNA-DNA hybrid is 
fragmented by RNase H as £ coli DNA polymerase I synthesizes the second-strand fragment 
The ends of the double stranded cDNA are repaired, linkers carrying specific restriction 
enzyme site (for example, Eco RI) are ligated to the ends using E. coli DNA ligase. The pool 
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of the cDNA insert is then ligated into one of a variety of bacteriophage vectors that are 
commercially available like Lambda-Zap (Stratagene). The procedures are summarized in 
more detail as follows: 

a. First strand cDNA Synthesis 

Dissolve 10 mg poly(A) + RNA at a concentration of 1 mg/ral in sterile water. Heat 
the RNA for two to five minutes at eS-TO^C then quench immediately on ice. In a separate 
tube add in the following order (180 ml total) : 20 ml of 5 mM dNTPs (500 fiM final each), 
40 ml 5x RT buffer (1 xfinal), 10 ml 200 mM DTT (10 mM final), 20 ml 0.5 mg/ml oligo 
(dT)12.18 (50 mg/ml final), 60 ml deionized water, 10 ml (10 units) RNasin (50 units/ml 
final). Mix by vortexing, briefly microcentrifiige, and add the mixture to the tube containing 
the RNA. Add 20 ml (200 U) AMV or MMLV reverse transcriptase for a final concentration 
of 1000 units/ml in 200 ml. Mix by pipetting up and down several times and remove 10 ml 
to a separate tube containing 1 ml of a^^P dCTP. Typically, incubate both tubes at room 
temperature for five minutes, then place both tubes at 42°C for one and a half hours. This 
radiolabeled aliquot is removed to determine incorporation and permit an estimation of 
recovery; this reaction is stopped by adding 1 ml of 0.5 M EDTA 8.0) and stored frozen 
at -20°C. The radiolabeled reaction will be used later to estimate the yield and average size of 
the cDNA inserts. The main reaction is stopped by addmg 4 ml of 0.5 M EDTA (pH 8.0) and 
200 ml buffered phenol. The mixture is vortexed well, microcentrifiiged at room temperature 
for one minute to separate phases, and the upper aqueous layer is transferred to a fresh tube. 
Back extract the phenol layer with 1 xTE buffer (10 mM Tris, 1 mM EDTA, pH 7.5) and pool 
the aqueous layers from the two extractions. This back extraction of the phenol layer 
improves the yield. The cDNA is ethanol precipitated using 7.5 M ammonium acetate (final 
concentration 2.0 to 2.5 M) and 95% ethanol. Place in dry ice/ethanol bath fifteen minutes, 
warm to 4°C, and microcentrifuge at ten minutes at full speed at 4'*C to pellet nucleic acids. 
The small, yellow-white pellet is then washed with ice-cold 70% ethanol, and 
microcentrifiiged for three minutes at fiiU speed, 4**C. Again, the supernatant is removed and 
the pellet briefly dried. 
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b. Second strand synthesis, 

Typically, the pellet from the first-strand synthesis is resuspended in 284 ml water 
and these reagents are added in the following order (400 ml total): Four ml of 5 mM dNTPs 
(50 \iM final each), 80 ml 5x second-strand buffer (Ixfinal), 12 ml 5 mM P-NAD (150 
final). 2 ml 10 uCi/ml a-^^P dCTP (50 ^Ci/ml final). Mix by vortexing, briefly 
microcentrifiige, and add: 4 ml (4 imits) RNase H (10 units/ml final), 4 ml (20 units) E, coli 
DNA ligase (SO units/ml final), and 10 ml (100 units) E, coli DNA polymerase I (250 
units/ml final). Mix by pipetting up and down, briefly microcentrifiige, and incubate twelve to 
sixteen hours at 14°C. After second-strand synthesis, remove 4 ml of the reaction to 
detemiine the yield firom the incorporation of the radiolabel into acid-insoluble material. 
Extract the second-strand synthesis reaction with 400 ml buffered phenol and back extract the 
phenol phase with 200 ml TE buffer (pH 7.5) as described above. The double stranded cDNA 
is then ethanol precipitated as described above. 

To complete the second-strand synthesis the double-stranded cDNA ends are 
rendered blunt usmg a mixture of enzymes. Resuspend the pellet in 42 ml water then add 
these reagents in the following order (80 ml total): 5 ml 5 mM dNTPs (310 fiM final each), 
16 ml 5xTA buffer (1 xfinal), 1 ml 5 mM p-NAD (62 yM final). Mix by vortexing, 
microcentrifiige briefly, and add: 4 ml of 2 mg/ml RNase A (100 ng/ml final), 4 ml (4 units) 
RNase H (50 units/ml final), 4 ml (20 units) E. coli DNA ligase (250 units/ml final) and 4 ml 
(8 units) T4 DNA polymerase (100 units/ml final). Mix as above and incubate forty-five 
minutes at Add 120 ml TE buffer (pH 7.5) and 1 ml of 10 mg/ml tRNA. Extract with 
200 ml buffered phenol and back extract the phenol layer with 100 ml TE buffer as described 
above. Pool the two aqueous layers and ethanol precipitate as described above. 

c. Addition of linkers to double stranded cDNA 

Combine fliese reagents in a 0.5 ml microcentrifiige test tube, at room temperature, 
and in the following order: 100 ng double stranded cDNA, 2 ml linkers/adaptors (1 0 mM), 2 
ml 5x DNA Ligation Buffer, 1 ml T4 DNA Ligase (unit/ml) for a 10 ml final volume. Mix 
the contents thoroughly with gentle pipetting and spin tfie tube briefly in a microcentrifiige. 
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Incubate at either: 16°C overnight; or room temperature (19-23 °C) for three to four hours. 
Inactivate the ligase enzyme by heating the mixture at 70''C for five minutes. This cDNA is 
typically digested by Eco RI to prqiare the cohesive ends of the cDNA for ligation into the 
vector and to cleave link^ concatemers. Typically this reaction consists of the 10 ml of the 
cDNA, 2 ml of lOx Eco RI buffer (depending on fhs company of source), 2 ml of Eco RI (10 
units/ml) and sterile water to a final volume of 20 ml. The mixture is incubated at 37'*C for 
two to four hours. 

d. SizeJrwUgmUQn ofcPNA 

Size exclusion columns are typically used to remove linker molecules and short 
cDNA fi:agments (350 bp). For example, a 1-ml Sepharose CL-4B colunm is prepared in a 
plastic column plugged with a small piece of sterilized glass wool (a 5 ml plastic pipet will 
work fine). The column is equilibrated with 0.1 M sodium chloride in 1 xTE (lOmM Tris, 1 
mM EDTA, pH 7.5). The cDNA is (hen loaded onto the column and 200 fil firactions are 
collected. 2 ^1 aliquots of each fii^action are analyzed by gel electrophoresis and 
autoradiography to determine the peak of cDNA elution. Typically, fi:BCtions containing the 
first half of the peak are pooled and piuified by ethanol precipitation and resuspending in 10 
fil distilled water. 

e. Cloning of cDNA into bacteriophage vector 

Bacteriophage vectors designed for the cloning and propagation of cDNA are 
provided ready-digested witti Eco RI and with phosphatased ends fi-om commercial sources 
(e.g., lambda gtlO fix)m Stratagene). The prepared cDNA is ligated into lambda vectors 
following manufacturer's instructions. Ligated vector/cDNA molecules are packaged into 
phage particles usmg packaging extracts available commercially. 
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ExampleS: Screening of Human cDNA Library 

a. Labeling of cDNA fra ^ents (probes) for library screening 
An RNA probe is used that recognizes nucleotide sequences that are specific to NaNy 
such as 1371-1751 of NaN. Other nucleotide sequences can be developed on the basis of the 
NaN sequence (Figures 2, 7 & 8) such as nucleotides 765-1 160 of the human nucleotide 
sequence. A Hind ISM Bam HI fragment of NaN was inserted in pBluescript (SK+) vector 
(Stratagene). The sequence ofthe resulting construct was verified by sequencing. The 
orientation of the insert is such that the 5' and 3' ends of the construct delineated by the Hind 
in and Bam HI restriction enzyme sites, respectively, are proximal to T7 and T3 RNA 
polymerase promoters, respectively. Digoxigenin-labeled Sense (linearized at the Hind JSL site 
and transcribed by T7 RNA polymerase) and antisense (linearized at the Bam HI site and 
transcribed by T3 RNA polymerase) transcripts were prepared in vitro usmg MEGAscript 
transcription kit (Ambion) according to manufacturer specifications. Briefly, 1 ng linearized 
template was transcribed with the respective RNA polym^ase in a 20 jxl final volume 
containing the following reagents: 1 x enzyme mixture containing the respective RNA 
polymerase and RNase mhibitor and reaction buffer (Ambion), 7.5 mM ATP, GTP and CTP 
nucleotides, 5.625 mM UTP and 1.725 mM Dig-1 lUTP (Boehringer Mannheim). In vitro 
transcription was carried out at 37 °C for three hours in a water bath. DNA template was 
removed by adding 1 jil of RNase-free DNase I (2 units/^1) to each reaction and incubatmg 
further at 37'*C for fifteen minutes. The reaction was then stopped by adding 30 jil nuclease- 
free water) and 25 jil of LiCl precipitation solution (7.5 M Lithium Chloride, 50 mM EDTA). 

The mixture was incubated at -20**C for thirty minutes. The RNA transcripts were 
pelleted in a microfiige at 13000xg, 4°C for fifteen minutes. The siq)ematant was removed 
and the pellet washed once with 1 00 (il of 75% ethanol. The mixture was re-centrifuged at 
13000xg, room tenq)erature for five minutes. The pellet was then air-dried in a closed 
chamber and subsequently dissolved in 100 ml of RNase-fi«e water. The transcript yield and 
integrity were determined by comparison to a control DIG-labeled RNA on agarose- 
formaldehyde gel as described in flie DIG/Genius kit according to manufacturer 
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recommendations (Boehringer Mannheim). Alternatively, a skilled artisan can design 
radioactive probes for autoradiographic analysis. 

Other regions of the rat, mouse or human NaN sodium channel cDNA, like 3 ' 
imtranslated sequences, can also be used as probes in a similar fashion for cDNA library 
screening or Northern blot analysis. Specifically, a probe is made using commercially 
available kits, such as the Phaimacia oligo labeling kit, or Genius kit (Boehringer Mannheim). 

b. cDNA library screening 

Recombinant plaques containing full length human homologues of the NaN sodium 
channel are detected using moderate stringency hybridization washes (50-60*'C, S^SSC, thirty 
minutes), using non-radioactive (see above) or radiolabeled DNA or cRNA iVoAT-specific 
probes derived from the 3' untranslated or other regions as described above. Libraries are 
screened usmg standard protocols (Sambrook et al, (1989) Molecular Cloning - A Laboratory 
^proach. Cold Spring Harbor Press; Ausubel et al, (1995) Current Protocols in Molecular 
Biology, Greene Publishmg) involving the production of nitrocellulose or nylon membrane 
filters carrying recombinant phages. The recombinant DNA is then hybridized to NaN- 
specific probes (see above). Moderate stringency washes are carried out. 

Plaques which are positive on duplicate filters {Le,, not artefacts or background) are 
selected for further purification. One or more rounds of screening after dilution to separate the 
phage are typically performed. Resulting plaques are then purified, DNA is extracted and the 
insert sizes of these clones characterized. The clones are cross-hybridized to each other using 
standard techniques (Sambrook et al, (1989) Molecular Cloning - A Laboratory Approach, 
Cold Spring Harbor Press) and distinct positive clones identified 

Typically, overlapping clones that encode the channel are isolated. Standard cloning 
techniques are then used to produce a fiill length cDNA construct that contams any S* 
untranslated sequrace, the start codon ATG, the coding sequence, a stop codon and any 3* 
untranslated sequ^ce, a poly A consensus sequence and possibly a poly A run. If overlapping 
clones do not produce sufficient Augments to assemble a full length cDNA clone, alternative 
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methods like RACE (PCR-based) could be used to generate the missing pieces or a full length 
clone* 

Characterization of a human homologue fuU-length clone 
A iVldiV-specific cDNA sequence from a full-length clone is used as a probe in 
Northern blot analysis to determine the messenger RNA size in human tissue for comparison 
with the rat and mouse messenger RNA size. Confirmation of biological activity of the cloned 
cDNA is carried out using methods similar to those described for the rat NaN, 

Example 9: Polymerase chain reaction fPCR) approaches to clone other full length human 
sodium channels usmg DNA sequences derived from rat mouse or human amino acid 
sequences 

Total RNA and poly A+ RNA is isolated from human dorsal root ganglia or 
trigerninal ganglia or other cranial ganglia from post-mortem human material or foetuses or 
biopsy/surgical tissues as described above. Preparation of cDNA and PCR-based methods are 
then used as described previously in Example 1. 

Using degenerate PCR primers derived from the rat, mouse or human MiAT-specific 
coding sequence (see Figures 2 (SEQ ID NO: 3), 7B (SEQ ID NO: 5), 8B (SEQ ID NO: 8) 
and 1 IB (SEQ ID NO: 41)), the cDNA is amplified using the polymerase chain reaction (Saiki 
et al, (1985) Science 230, 1350-1354). A skilled artisan could utilize the many variables 
which can be manipulated in a PCR reaction to derive the homologous sequences required. 
These include, but are not limited to, varying cycle and step temperatures, cycle and step 
times, nimiber of cycles, thermostable polymerase, and M.^^ concentration. A greater 
specificity can be achieved using nested primers derived from further conserved sequences 
from the A^oA^ sodium channel. 

Amplification is typically performed in 60 [il volume using 1 yA of the first strand 
cDNA, 0.8 mM of each primer and 1,75 units of Expand Long Template DNA polymerase 
enzyme mixture (Boehringer Mannheim). Compared to conventional and thermostable DNA 
polymerases. Expand Long Template enzyme mixture increases the yield of the PCR products 
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without an increase in non-specific amplification (Barnes, (1994) Proc. Natl. Acad. Sci. USA 
91, 2216-2220; Cheng et al, (1994) Proa Natl. Acad. Sci. USA 91, 5695-5699). The PGR 
reaction buffer consists of 50 mM Tris-HCl (pH 9.2), 16 mM (NH4)2S04, 2.25 mM MgCh, 2% 
(v/v) DMSO and 0.1% Tween 20. As described previously (Dib-Hau et aL, (1996) FEBS 
Lett. 384, 78-82), amplification is earned out in two stages using a progranunable thermal 
cycler (PTC-200, MJ Research). Fust, a denaturation step at 94**C for four minutes, an 
annealing step at SO^'C for two minutes and an elongation stq> at 72**C for ninety seconds. 
Second, a denaturation step at 94'*C for one minute, an annealing step at eO^C for one minute 
and an elongation step at 72''C for ninety seconds. The second stage is repeated 33 times for a 
total of 35 cycles, with the elongation step in the last cycle extended to 10 minutes. In 
addition, control reactions are perfomied alongside fhe samples. These should be: (1) all 
components without cDNA, (negative control) and (2) all reaction components with primers 
for constitutively expressed product, e.g, GAPDH, 

The products of the PGR reactions are examined on 1-1.6% (w/v) agarose gels. 
Bands on the gel (visualized by staming with ethidiimi bromide and viewing under UV light) 
representing amplification products of the approximate predicted size are then cut fix>m the gel 
and the DNA purified. 

The resulting DNA may be sequenced directly or is Ugated into suitable vectors such 
as, but not limited to, pGR n (Invitrogen) or pGEMT (Promega). Glones are then sequenced 
to identify those containing sequence with similarity to the rat, mouse or partial human NaN 
sodium channel sequence. 

Example 10: Glone analysis 

Candidate clones fi:om Example 9 are fiirther characterized by conventional techniques. 
The biological activity of expression products is also confirmed using conventional 
techniques. 
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Example 11: Isolation of full length jVaj^ sequences from human fetal tissues 

Commercially available human fetal cDNA libraries and/or cDNA pools are screened 
with iVoiV-specific primers (by PCR) or probes (library screening) using PGR standard PGR 
protocols and standard library screening procedures as described above. 

5 Example 12: Northem Blot of rat DRG or Trigeminal Neurons with Fra^ ents of Rat NaN 
10-30 )xg total DRG and/or RNA from DRG or trigeminal (for positive tissues) and 
cerebral hemisphere, cerebellum and liver (for negative tissues) is electrophoresed in 
denaturing 1% agarose-formaldehyde gel or agarose-glyoxal gel, and then is transferred to.a 
nylon membrane as described in achieved in multiple steps, as detailed in standard molecular 

10 biology manuals (Sambrook et aL, (1989) Molecular Cloning - A Laboratory Approach, Cold 
Spring Harbor Press; Ausubel et aL^ (1995) Current Protocols in Molecular Biology, Greene 
Publishing). Radiolabeled (specific activity of >10* dpm/pg) or Digixoginen-labeled RNA 
probes are typically used for Nordiem analysis. An antisense RNA probe (see Example 20, 
which describes in situ hybridization with a iVoA/^-specific probe) is created by in vitro 

15 synthesis 6om a sense DNA fragment. The membrane carrying the immobilized RNA in 
wetted with 6xSSC, and the membrane is placed RNA-side-up in a hybridization tube. One 
ml formamide prehybridization/hybridization solution per 10 cm^ of membrane is added. 
Prehybridization and hybridization are usually carried out in glass tubes in a commercial 
hybridization oven. The tubes are place in a hybridization oven and incubated, with rotation, 

20 at 60°C for fifteen minutes to one hour. The desired volume of probe is pipeted into the 
hybridization tube, and the incubation is continued with rotation overnight at eO'^C. The 
probe concentration in the hybridization solution should be 10 ng/ml if the specific activity is 
10^ dpm/jig or 2 ng/ml if the specific activity is 10' dprn/fig (use 2-10 ng/ml of Digixogenin 
labeled probe). 

25 The hybridization solution is poured off and an equal volume of 2xSSC/0. 1 % SDS is 

added. Incubation with rotation for 5 minutes at room temperature is carried out. The wash 
solution is changed, and this step is repeated. To reduce background, it may be beneficial to 
double the volume of the wash solutions. The wa^ solution is rq)laced with an equal volume 
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of 0.2xSSC/0.1% SDS and the tube is incubated for five minutes with rotation at room 
temperature. The wash solution is changed and this step is repeated (this is a low-stringency 
wash). For moderate or high stringency conditions, further washes are done with wash 
solutions pre-warmed to moderate (42 °C) or high (68 ^'C) temperatures. The final wash 
solution is removed and the membrane rinsed in 2xSSC at room temperature. 
Autoradiography is then performed for up to one week. Alternatively, signal is detected using 
chemiluminescence technology (Amersham) if non-radioactive probes are used. The 
transcript size is calculated fix>m the signal fi^om the gel in comparison with gel molecular 
weight standard markers. 

Example 13: T^ss^g sp^ific distriMipn of NaN l?y RT-PCR 

iVaiV-specific forward (5' CCCTGCTGCGCTCGGTGAAGAA 3') (SEQ ID NO: 39) 
and reverse primer (5' GACAAAGTAGATCCCAGAGG 3') (SEQ ID NO: 25), were used in 
RT-PCR assays using cDNA template prepared firom multiple rat. These primers amplify 
NaN sequence between nucleotides 765 and 1 156 (392 bp) and are MzA^-specific as judged by 
lack of sunilarity to sequences in the database (using programs like BLASTN from ttie 
National Center for Biotechnology Information). Amplification was typically performed in a 
60 ^1 volume using 1 pi of fiie first strand of cDNA, 0.8 pM of each primer and 1 .75 units of 
Expand Long Template DNA polymerase enzyme mixture (Boehringer Mannheim). 
Compared to conventional and thermostable DNA polymerases. Expand Long Template 
enzyme mixture increases the yield of the PCR products without an increase in non-specific 
amplification (Barnes, (1994) Proc. Natl. Acad. Sci. USA 91, 2216-2220; Cheng et a/., (1994) 
Proc. Natl. Acad. Sci. USA 91 , 5695-5699). The PCR reaction buffer consisted of 50 mM 
Tris-HCl (pH 9.2), 16mM (NH4)2S04, 2.25 mM MgCb, 2% (v/v) DMSO and 0.1% Tween 20. 
As described previously (Dib-Hajj et a/., (1996) FEES Lett. 384, 78-82), amplification was 
carried out in two stages using a programmable thermal cycler (PTC-200, NO Research). 
First, a denaturation step is performed at 94^C for four minutes, followed by an annealing step 
at 60°C for two minutes, and then an elongation step at 72°C for ninety seconds. Second, a 
denaturation step is performed at 94 ""C for one minute, followed by an annealing step at 60°C 
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for one minute, and then an elongation step at 72 for ninety seconds. The second stage was 
repeated 33 times for a total of 25-35 cycles, with the elongation step in the last cycle 
extended to ten minutes. 

GlyceraIdehyde-3-phosphate dehydrogenase (GAPDH) was used as an internal 
control to ensure that a lack ofNoN signals in different tissues was not due to degraded 
templates or presence of PCR inhibitors. Rat GAPDH sequences were co-amplified using 
primers which amplify a 66 bp product that corresponds to nucleotides 328-994 (accession 
number: Ml 7701). The amplified product spans multiple exon/intron splice sites, based on 
the structure of the human gene (Benham et al, (1 987) Nature 328, 275-278). Dnasel 
treatment was routinely performed prior to reverse transcription to prevent ampUfication of 
GAPDH sequences fi'om genomic processed pseudogenes that may have contaminated the 
total RNA preparation (Ercolani et aL, (1988) J. Biol. Chem. 263, 15335-15341). 

NaN is primarily and preferentially expressed in DRG and trigeminal ganglia 
neurons. Figure 4 shows the results of screiming by RT-PCR for the expression of NaN in 
various neuronal and non-neuronal tissues. Lanes 1, 2, 4, 9 and 16 show a smgle 
amplification product co-migratmg with the 400 bp marker, consistent with NoN-speci&c 
product of 392 bp. Lanes 1, 2, 4, 9 and 16 contain products using DRG, cerebral hemisphere, 
retina, and trigeminal ganglia, respectively. Using this assay, NaN was not detected in 
cerebellum, optic nerve, spinal cord, sciatic nerve, superior cervical ganglia, skeletal muscle, 
cardiac muscle, adrenal gland, uterus, liver or kidney (lanes 3, 5-8, and 10-15, respectively). 
The attenuated NaN signal in cerebral hemisphere and retina, and the absence of this signal in 
the remaining tissues is not due to degraded RNA or the presence of PCR inhibitors in the 
cDNA templates as comparable GAPDH amplification products were obtained in a parallel set 
of PCR reaction (data not shown). 

Example 14: Transfomiation of a Host Cell with the NaN Coding Sequence 

Transformed host cells for the measurement of Na^ cim-ent or intracellular Na* levels 
are usually prepared by co-transfecting constructs into cells such as HEK293 cells with a 
fluorescent reporter plasmid (pGreen Lantem-1, Life Technologies, Inc.) using the 
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calcium-phosphate precipitation technique (Ukomadu et al, (1992) Neuron 8, 663-676). 
HEK293 cells are typically grown in high glucose DMEM (Life Technologies) supplemented 
with 10% fetal calf serum (Life Technologies). After 48 hours, cells with grem fluorescence 
are selected for recording (Dib-Hajj et aL, (1997) FEES Lett. 416, 1 1-14). 

For preparation of cell lines continuously expressing recombinant channels, fhe NaN 
construct is cloned into other vectors that cany a selectable marker in mammalian cells. 
Transfections are carried out using the calcium phosphate precipitation technique (Ukomadu 
et al, (1992) Neuron 8, 663-676). Human embryonic kidney (HEK-293), Chinese hamster 
ovary (CHO) cells, or other suitable cell lines are grown under standard tissue culture 
conditions in Dulbeccos's modified Eagle's medium siq>plement6d with 10% fetal bovine 
serum. The calcium phosphate-DNA mixture is added to the cell culture medium and left for 
fifteen to twenty hours, after which time the cells are washed with fresh medium. After forty- 
eight hours, antibiotic (G418) is added to select for cells which have acquired neomycin 
resistance. After two weeks in G418, 10-20 isolated cell colonies are harvested using sterile 
10ml pipette tips. Colonies are grown for another four to seven days, split and subsequently 
tested for channel expression using whole-cell patch-clamp recording techniques and 
RT-PCR. 

Example 15: Production of NaN specific Antibodies 

Antibodies specific to the rat, mouse or human NaN are produced with an 
immunogenic NaN-specific peptide by raising polyclonal antibodies in rabbits. In one 
example, the peptide CGPNPASNKDCFEKEKDSED (rat amino acids 285-304) (SEQ ID 
NO: 40) was selected because it fits the criteria for immunogenecity and surface accessibility. 
This peptide sequence does not match any peptide in the public databases. The underlined 
cysteine (C) residue was changed to Alanine (A) to prevent disulfide bond formation. This 
amino acid change is not expected to significantly affect the specificity of the antibodies. 

Peptide synthesis, rabbit inmiimization, and affinity pmification of the antipeptide 
antibodies were performed using standard procedures. Purified antibodies were then tested on 
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DRG neurons in culture. Immunostaining procedures using these antibodies before and after 
blocking with excess peptide were performed according to standard procedures. 

DRG neurons after sixteen to twenty-four hours in culture were processed for 
immunocytochemical detection of NaN protein as follows. Coverslips were washed with 
complete saline solution (137 mM NaCl, 53 mM KCI, 1 HIM M902 25 mM sorbitol, 10 mM 
HEPES, 3 mM CaCh (pH 7.2)), fixed with 4% paraformaldehyde in 0. 14 M phosphate buffer 
for ten minutes at 4^C, washed with three five minutes with phosphate-buffered saline (PBS), 
and blocked with PBS containing 20% normal goat serum, 1% bovine serum albumin aiid 0. 1 
% Triton X- 100 for fifteen minutes. The coversUps were incubated in anti-NaN antibody 
(1:100 dilution) at 4°C overnight Following overnight incubation, coverslips were washed 
extensively in PBS and then incubated wifli goat anti-rabbit IgG-conjugated to Cy3 (1:3000; 
Amersham) for two hours at room temperature. The coverslips were rinsed with PBS and 
mounted onto glass slides with Aqua-poly-mount. The neurons were examined with a Leitz 
Aristoplan light microscope equipped with epifluorescence and images were captured with a 
Dage DC330T color camera and Scion CG-7 color PCI frame grabber (see Figure 7). 

Example 16: NaN expression is altered in a neuropathic pain model 

The CCI model of neuropathic pain was used to study the plasticity of sodium channel 
expression in DRG neurons. Twenty two adult, femal Sprague-Dawley rats, weighing 240- 
260 g were anesthetized with pentobarbital sodium (50 mg/kg ip) and the right sciatic nerve 
exposed at the mid-thigh. Four chromic gut (4-0) ligatures were tied loosely around the nerve 
as described by Bennett & Xie, (1988) Pain 33, 87-107. The incision site was closed in layers 
and a bacteriostatic agent admmistered intramuscularly. 

Previous studies have shown that transection of the sciatic nerve induces dramatic 
changes in sodiimi currents of axotomized DRG neurons, which is paralleled by significant 
changes to transcripts of various sodium channels expressed in these neurons. Sodium 
currents that are TTX-R and the transcripts of two TTX-R sodium channels (SNS/PN3 and 
NaN) are significantly attenuated while a rapidly repriming silait TTX-S current emerges and 
the transcript of a-III sodium channel, which produces a TTX-S current, is up-regulated. We 
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have discovered that CCI-induced changes in DRG neurons, fourteen days post-surgery, 
mirror those of axotomy. Transcripts of NaN and SNS, the two sensory neuron-specific TTX- 
R channels, are significantly down-regulated as is the TTX-R sodium cxurent, while 
transcripts of the TTX-S a-III sodium channel are up-regulated, in small diameter DRG 
neurons. These changes may be partly responsible for making DRG neurons hyperexcitable, 
that contributes to the hyperalgesia that results fix>m this injury. 

Example 17: Mm^ for flg^tS wlUfib mpdut^tg ft? ^ctjyiity Qf ft? NaN Phmne} usjpg patph 
clamp methods 

Cells lines expressing the cloned Na^ channel are used to assay for agents which 
modulate the activity of the NaN channel, e,g.y agmts which block or inhibit the channel or 
enhance channel opening. Since the channel activation is voltage dq>endent, depolarizing 
conditions may be used for observation of baseline activity that is modified by the agent to be 
tested. Depolarization may be achieved by any means available, for example, by raising the 
extracellular potassium ion concentration to about 20 to 40 nM, or by repeaXed electrical 
pulses. 

The agent to be tested is incubated with HEK 293 or other transformed cells that 
express the Na* channel (Dib-Hajj et a/., (1997) FEBS Lett. 416, 1 1-14). After incubation for 
a sufficient period of time, the agent induced changes in Na^ channel activity can be measured 
by patch-clamp methods (Hamill et ai, (1981) Pflugers Arch. 391, 85-100). Data for these 
measurements are acquired on a Macintosh Quadra 950, or similar computer, using a program 
such as Pulse (v 7.52, HEKA, German). Fire-polished electrodes (0.8-1.5 MW) are fabricated 
fi^om capillary glass using a Sutter P-87 puller or a similar instrument. Cells are usually only 
considered for analysis if initial seal resistance is <5 Gohm, they have high leakage currents 
(holding current <0.1 nA at -80 mV), membrane blebs, and an access resistance <5 Mohm. 
Access resistance is monitored and data is not used if resistance changes occur. Voltage errors 
are minimized using series resistance compensation and the capacitance artifact will be 
canceled as necessary using computer-controlled ampUfier circuitry or other similar methods. 
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For comparisons of the voltage dependence of activation and inactivation, cells with a 
maximum voltage error of <10 mV after compensation are usually used. Linear leak 
subtraction is used for voltage clamp recordings. Membrane currents are typically filtered at 5 
KHz and sampled at 20 KHz. The pipette solution contains a standard solution such as: 140 
mM CsF, 2 mM MgCh, 1 mM EGTA, and 10 mM Na-HEPES (pH 7.3). The standard bathing 
solution is a standard solution such as 140 mM NaCl, 3 mM KCl, 2 mM MgCh^ 1 mM CaCh, 
10 mM HEPES, and 10 mM glucose (pH 7.3). 

Tetrodotoxin (TTX)-resistant and TTX-sensitive Na* currents are measured by 
exposure to q}propiiate concentrations of TTX and/or by pre-pulse protocols which 
distinguish between TTX-sensitive and TTX-resistant currents on the basis of their distinct 
steady-state inactivation properties (Cummins & Waxman (1997) J. Neurophysiol. 17, 
3503-3514; Sontheimer & Waxman, (1992) J. Neurophysiol. 68, 1001-1011). 

Data are collected using standard pulse protocols and are anal3^ed to measure sodium 
current properties that include voltage-dependence, steady-state characteristics, kinetics, and 
re-priming. Measurements of current amplitude and cell capacitance provides an estimate of 
Na^ current density, thereby permitting comparisons of channel density under different 
conditions (Cummins & Waxman (1997) J. Neurophysiol. 17, 3503-3514,30). Cells are 
studied in the current clamp mode to study patterns of spontaneous and evoked action 
potential generation, threshold for firing, firequency response characteristics, and response to 
de- and hyperpolarization, and other aspects of electrogenesis (Sontheimer & Waxman, (1992) 
J. Neurophysiol. 68, 1001-101 1). These measurements are carried out both in control cells 
expressing NaN and in cells expressing NaNihaX also have been exposed to the agent to be 
tested. 

Example 18: Assays for agents which modulate the activity of the NaN charmel bv the 
measurement of hutracgllular Sodium [N^^ ] 

The agent to be tested is incubated with cells exhibiting NaN chaimel activity. After 
incubation for a sufficient period of time, the agent induced changes in Na^ channel are 
measured by ratiometric imaging of [Na^ ]{ using SBFI. In this method, cytosolic-firee Na^ is 
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measured using an indicator for Na^ such as SBFI (sodium-binding benzofuran isophthalate 
(Harootunian et al, (1989) J. Biol. Chem. 264, 19458-19467)); or a similar dye. Cells are first 
loaded with the membrane-permeable acetoxymethyl ester form of SBFI (SBFI/AM) or a 
similar dye (usually dissolved in dimethyl sulfoxide (DMSO) at a stock concentration of 10 
mM). Recordings are obtained on the stage of a microscope using a commercially available 
ratiometric imaging setup {e.g,, bom Georgia Instruments). Excitation light is provided at 
appropriate wavelengths (eg., 340:385 nm). Excitation light is passed to the cells through a 
dichroic reflector (400 nm) and emitted light above 450 nm was collected. Fluorescence 
signals are amplified, e.g.^ by an image intensifier (GenllSyS) and collected with a CCD 
camera, or similar device, interfaced to a fi^me grabber. To accoimt for fluorescence 
rundown, the fluorescence ratio 340:385 is used to assay cytosolic-free Na\ 

For calibration of SBPFs fluorescence, cells are perfused with calibration solutions 
containing known Na* concentrations (typically 0 and 30 mM, or 0, 30, and 50 mM [Na^], and 
gramicidin and monensin. As reported by Rose and Ransom (Rose & Ransom, (1 996) J. 
Physiol. (Lond) 491, 291-305), the 345/390 nm fluorescence ratio of intracellular SBFI 
changes monotonically with changes in [Na^ ]j. Experiments are repeated on multiple 
(typically at least four) different coverslips, providing statistically significant measurements of 
intracellular sodium in control cells, and in cells exposed to various concentrations of agents 
that may block, inhibit or enhance the activity of the channel. Use of this method is illustrated 
in Sontheimer et al, (1994) J. Neurosci. 14, 2464-2475. 

Example 19: Assays for agents which modulate the activity of the NaN channel bv 
scintigraphic imaging 

Cells lines expressing the cloned Na"*" channel are used to assay fSr agents which 
modulate the activity of the NaN channel, eg., agents which block the channel or enhance 
channel opening. For example, the agent to be tested is incubated with HEK 293 or other 
transformed cells that express the Na^ channel (Dib-Hajj et a/., (1997) FEBS Lett. 416, 
1 1-14). After incubation for a sufficient period of time, the agent induced changes in Na^ 
channel activity are detected by the measurement of Na^ influx by isotopic methods, ^a is a 
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gamma emitter and can be used to measure Na^ flux (Kimelberg & Waltz, (1988) The 
Neuronal Microenvironment (Boulton et al., editors) Humana Press) and *^Rb*can be used to 
measure Na^/K*ATPase activity which provides a measure of Na channel activity (Sontheimer 
et aL, (1994) J. Neurosci, 14, 2464-2475) ^^Rb^ ions are taken up by the Na*/K+ATPase like 
K+ ions, but have the advantage of a much longer half-life than ^^K"^ (Kimelberg & Mayhew 
(1975) J. Biol. Chem. 250, 100-104). Thus, measuremrat of the unidirectional 
ouabain-sensitive ^*Rb* uptake provides a quantitative method for assasdng NaVK*-ATPase 
activity which follows action potentials. 

Following incubation of cell expressing NdN to the isotope, ttie cellular content of ttie 
isotope is measured by liquid scintillation counting or a similar method, and cell protein is 
determined using a method such as the bicinchoninic acid protein assay (Smith et aL^ (1985) 
Anal. Biochem. 150, 76-85) following the modifications (Goldschmidt & Kimelberg (1989) 
Anal. Biochem. 177, 41-45) for cultured cells, ^a and '^Rb^ fluxes are determined in the 
presence and absence of agents that may block, inhibit, or enhance Na^ This permits 
determination of the actions of these agents on NdN. 

Example 20: In situ hvbridization 

a. Probes 

Probes are prepared as described above in Example 5. 

b. ORG Neuron Culture 

Cultures of DRG neurons from adult rats were established as described previously 
(Flizzo et al, (1994) J. Neunophysiol. 72, 2796-2815). Briefly, lumbar ganglia (L4, L5) from 
adult Sprague Dawley female rats were freed from their connective sheaths and incubated 
sequentially in enzyme solutions containing coUagenase and then papain. The tissue was 
triturated in culture medium containing 1:1 Dulbecco's modified Eagle's medium (DMEM) 
and Hank's F12 medium and 10% fetal calf serum, 1.5 mg/ml trypsin inhibitor, 1.5 mg/ml 
bovine scrum albumin, 100 units/ml penicillin and 0.1 mg/ml streptomycin and plated at a 
density of 500-1000 cells/mm^ on polyomithine/laminin coated coverslips. The cells were 
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maintained at 37 °C in a humidified 95% air/5% CO2 incubator overnight and then processed 
for in situ hybridization cytochemistry as described previously (Black et al,, (1994) Brain Res. 
Mol. Brain Res. 23, 235-245; Zur et al,, (1995)Brain Res. Mol. Brain Res. 30, 97-105). 
Trigeminal ganglia can be cultured by a skilled artisan using sunilar methods. 

c. Tissue PrmrQtiQn 

Adult female Sprague Dawley rats were deeply anesthetized, e.g., with chloral hydrate 
and perfused through tfie heart, first with a phosphate-buffered saline (PBS) solution and then 
with a 4% paraformaldehyde in 0.14 M Sorensen's phosphate bufifer (pH 7.4) at 4'*C. 
Following perfiision fixation, dorsal root ganglia at levels L4 and L5 and trigeminal ganglia 
were collected and placed in firesh fixative at 4*'C. After two to four houra, the tissue was 
transferred to a solution containing 4% paraformaldehyde and 30% sucrose in 0. 14 M 
phosphate buffer and stored overnight at 4**C. Fifteen tim sections were cut and placed on 
poly-Uysine-coated slides. The sUdes were processed for in situ hybridization cytochemistry 
as previously described (Waxman et al, (1994) J. Neurophysiol. 72, 466-470; Black et at., 
(1994) Brain Res. Mol. Brain Res. 23, 235-245). Following in situ hybridization 
cytochemistry, the slides were dehydrated, cleared and mounted with Permount. The results 
are shown in Figure 5. 

Sections of DRG hybridized with NaN sense riboprobe showed no specific labeling 
(panel C, Figure 5). In DRG (panel A, Figure 5) and trigeminal (panel B) sections hybridized 
with a iVizT/antisense riboprobe, with the NaN signal present in most small (<30 mm diam.) 
neurons; in contrast, most large (>30 mm diam.) neurons did not exhibit A/iiAT hybridization 
signal. Sections of spinal cord, cerebellum and liver hybridized with an antisense NctN 
riboprobe showed no specific signal (panels D, B and F respectively). 

Example 21: Microsatellite Sequences 

The following are the murine intronic microsatellite sequences. These microsatellites 
may be polymorphic m the human SCNl la gene and could be used as markers to screen for 
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mutant alleles that are associated with a disease. Such screening methods, for instance, 
h}^ridization or amplification assays, are readily available. 

Ihtron 4; microsatellite is dTdG (SEQ ID NO: 29) 

AGTTTAATGTTGAGTGAATTGTGGTGGTGATTTCCCACITGAGGCCTTTGTGTTAA 
5 AGCCCAATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
GTGTGGTTGGGGGGTGGTGGCAGAGTCTGGTATTGGTAAGGTGAGAGCAATCCCA 
GAACGTCCACCTGCTCTTCCATTTTATTAATCAGGCAGGCCTCT 

Intron 5; microsatellite is dCdTdG (dNdG2) x (X5-30) (SEQ ID NO: 30) 
GTAAGCCACTGGCTCTTAACTAAAATGCTCGTTGGCATTAGAACATTTCTGAGCTG 
10 GGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGTGGT 
GATGGTGGTGGTGGAGGTGGNGGTGGAGGTGGTGGCTGTGGTGGTGGNGGTGGT 
GGTGGTGGTGGANGTGGANGTGGTGGCGTGGTGGTGGNGGTGGTGGTGGAGGTG 
GTGGCTGTGGTGGTNGTGGTGGC 

hitron 6; microsatellite is dCdA (SEQ ID NO: 31) 
15 TGTGCATGCTTGATTCCCAGCTCCTATGGTCTGATTACTCGGTCCTTAGGAGCAAG 
GCCAGACTGTCCACCCTGACACACACACACACACACACACACACACACACACAC 
ACACACACACACAGTGTAGAGAATTACCTCATTCTTGGAGTTTCTCTGGAAAAGG 
AATGTCTCAAAGCCAAGTTCACAGAGCAACAGCTG 

Intron 8; 5' microsatellite is dTdC followed by a stretch of dT (SEQ ID NO: 32) 
20 TGTTAGAAACTCTAAGACAATGAAGCACCATGCTGGAAATAAGAGCACAAACTCT 
TTCTTCATGCATTACCCACTGCTTGTGCTTTCACCTTAGTGCTCGTGCrCTCTC^ 
CTCTCrcrCTCTCTCTCTCrCrCTCTCTCTCrCT 
TTTTTTTTTT 
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Intcon 8; 3' microsatellite is dCdA (SEQ ID NO: 33) 

CACACACACACACACACACACACACACACACACACACACAGAGAAACACTGTCG 

CAGTCATACATATAAAGATAAATACATCTTAAAAAAAGAACCATGTGATTG 

AGTTATAAAATATTCCAACTTAT 

Intron lOB; microsatellite is dCdA followed, three nucleotides downstream by dCdAs (SEQ 
ID NO: 34) 

AGGTCATTTCCTCTGCAGTGTGCTTGGCAGGAAAAACTTCCTGGCTATTCAAGTCA 
GTGCCCTGCTTGATCATCCATGTATCACACACACACAAAACAAACAAACAAACAA 
ACAAAACCCTGGGGAAGAAGGAAGAGGTTAAGCACATAGGCAGAGAGCAGCCA 
GGCTGACTCAGAGCAAACACCTGATCATTCTTCCAT 

Intron 12; microsatellite is dPydG (dT/dCdG) (SEQ ID NO: 35) 

GTGCTGGGATCAAAGGCGTGCGCCGCCACCACGCCCGGCCCCTTTTTATGTTTCA 
AATTTACTTTTATCATGTGCACGTGTGTGGGTGCGTGCATGTGTGTGCGTGCGTGT 
GCGTGTGNGTGTGNGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

Intron 14; microsatellite is dCdA (SEQ ID NO: 36) 

CACACACACACACACACACACACACACACACACACACACACACACACACACACA 
CAOTGCATCnTrGAGTTAATTGGATAGGCTGAGTCTTACACCGGAATCATACTGT 
TGC 

Intron 15 A; microsatellite is dCdA (SEQ ID NO: 37) 

CCAATGAGAGACTCTTGTCTCAAAAAAGCCATGGTGTCCAGATCCTGAGGAATAA 
CACCTAAGAATGTGCTCTGGCCTGAAAACACACACACACACACACACACACACAC 
ACACACACACAGTTTTATTTATTTATTTAAAAAAATATGTCTCTAGGCATTGCTGA 
AATGTCTCCTACAGGATTAAGTCAACCAGAGCCA 
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It should be understood that the foregoing discussion and examples merely present a 
detailed description of certain preferred embodiments. It will be apparent to those of ordinary 
skill in the art that various modification and equivalents can be made without departing from 
the spirit and scope of the invention. The documents cited and referred to in this patent 
5 specification are hereby incorporated by reference in their entirety. 
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CLAIMS 

1. An isolated nucleic acid molecule selected from the group consisting of a nucleic 
acid molecule comprising SEQ ID NO: 41, a nucleic acid molecule which encodes the amino 
acid sequence of SEQ ID NO: 42, a nucleic acid molecule which encodes an allelic variant of 

5 SEQ ID NO:42, a nucleic acid molecule which encodes a human protein exhibiting at least 
about 76% amino acid sequence identity to SEQ ID NO:42 and a nucleic acid molecule that 
hybridizes to one of the foregoing sequences under stringent conditions. 

2. The isolated nucleic acid of claim I, wherein ttie nucleic acid encodes a voltage 
gated Na* channel that is preferentially expressed m dorsal root ganglia or trigeminal ganglia. 

3. The isolated nucleic acid of claim 2, wherein the nucleic acid encodes the human 
NaN sodium channel 

4. An expression vector comprising the isolated nucleic acid of any one of claims 1 to 
3, alone or togeflier with appropriate regulatory and expression control elements. 

5. A host cell transformed with the expression vector of claim 4, 

6. A Na* channel encoded by an isolated nucleic acid molecule of any one of claims 1 

to 3. 

7. The Na* channel of claim 6, comprising the amino acid sequence of SEQ ID 

NO:42. 

8. An isolated protein consisting of the amino acid sequence of SEQ ID NO: 42 or a 
peptide fragment thereof. 
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9. A protein comprised within a membrane fragment isolated from the host cell of 
claims. 

10. A method to identify an agent that modulates the activity of the Na"^ channel of 
claim 6, comprising the steps of bringing the agent into contact with a cell that expresses the 
Na^ channel on its surface and measuring any resultant changes in the sodium current, 
resultant change in membrane potential or change in intracellular Na^ 

11. The method of claim 10, wherein the measuring step is accomplished by voltage 
clamp measurements or measurement of membrane potential. 

12. The method of claim 10, wherein the measuring step is accomplished by 
measuring the level of intracellular sodium. 

13. The method of claim 10, wherein the measuring step is accomplished by 
measuring sodium influx. 

14. The method of claim 13, wherein the sodium influx is measured using ^^a or 

«^Rb. 

15. A method to identify an agent that modulates the transcription or translation of 
mRNA encoding the Na^ channel of claim 6, comprising the steps of bringing the agent into 
contact with a cell that expresses the Na^ channel and measuring the resultant level of 
expression of the Na^ channel 

16. A method to treat pain, paraesthesia and/or hyperexcitabiUty phenomena in an 
animal or human subject by administering an effective amount of an agent that alters Na^ 
current flow througih NaN channels in DRG or trigeminal neurons. 
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1 7. A method to treat pain, paraesthesia and/or hyperexcitability phenomena in an 
animal or human subject by administering an effective amount of an agent that modulates the 
transcription or translation of mRNA encoding the Na^ channel of claim 6. 

18. An isolated nucleic acid that is antisense to the nucleic acid of claim 1 and of 
sufficient length to modulate the expression of NaN channel in a cell containing the mRNA. 

19. A scintigraphic method to image the loci of pain generation or provide a measure 
of the level of pain associated with DRG or trigeminal neuron mediated hyperexcitability in an 
animal or human subject by administering labeled monoclonal antibodies or other labeled 
ligands specific for the human NaN Na^ channel. 

20. A method to identify tissues, cells or cell types that express the human NaN 
sodium channel, comprising the step of detecting human NaN on the cell surface or 
intracellularly. 

21. A method to identify tissues, cells or cell types that express human NaN 
comprising the step of detecting the presence therein of human NaN encoding mRNA. 

22. A method of producing a transformed cell that expresses an exogenous NaN 
encoding nucleic acid, comprising the step of transforming the cell with an expression vector 
comprising the isolated nucleic acid of any of claims 1 to 3, together with appropriate 
regulatory or expression control elements. 

23. An isolated antibody specific for flie human NaN channel or polypeptide firagment 
thereof. 

24* The isolated antibody of claim 23, wherein the antibody is labeled. . 
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25. A method of producing recombinant NaN protein, comprising the step of culturing 
the transformed host of claim 5 under conditions in which the NaN sodium chaimel or protein 
is expressed 

26. A therapeutic composition comprising an effective amount of an agent capable of 
altering, such as by increasing or decreasing, the rapidly repriming cunent flow in 
axotomized, inflamed or otherwise injured DRG neurons. 

27. A method to treat acute pain or acute or chronic neuropathic or inflammatoty pain 
and hyperexcitability phenomena in an animal or a human patient by administering the 
ther^eutic composition of claim 26. 

28. A method to screen candidate compounds for use in treating pain and 
hyperexcitability phenomena by testing their ability to upregulate or downregulate the NaN 
channel mRNA in axotomized, inflamed or otherwise injured DRG neurons. 

29. A chimeric NaN channel. 

30. A chimeric channel of claim 29, wherein at least one human domain has been 
substituted with the corresponding domain from the NaN channel of another species. 

3 1 . A chimeric channel of claim 30, wherein the species is rat or mouse. 

32. A nucleic acid molecule encoding a chimeric NaN channel of any one of claims 29 
through 31. 
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33. An NaN channel protein comprising a positively charged amino acid at a position 
corresponding to residue 670 of SEQ ID NO:42. 

34. An NaN channel protein of claun 33, wherein the positively charged amino acid is 
arginine. 

35. An isolated nucleic acid molecule encoding a channel protein of either of claims 
33 or 34. 

36. A therapeutic composition of claim 26, further comprising at least one second 
agent that modulates a channel in primary sensory neurons. 

37. A ther^eutic composition of claim 36, wherein the composition comprises agents 
which modulate NaN and at least one channel selected from the group consisting of PNl/hNE 
andSNS/PN3. 
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SEQUENCE LISTING 



<110> Yale University 

Dib-Hajj, Sulayman 
Waxtnan, Stephen G. 

<120> Modulation of Sodiiim Channels in Dorsal Root Ganglia 

<130> 44574-5004-02-WO 

<140> 
<141> 

<150> US 09/354,147 
<151> 1999-07-16 

<160> 44 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 5875 
<212> DNA 

<21.3> Rattus norvegicus 

<220> 

<221> CDS 

<222> (41) . . (5335) 

<223> cDNA sequence for rat NaN 

<220> 

<221> unsure 

<222> (1996) . . (4042) 
<223> n = a or c or g or t 

<400> 1 

acggtgccct gatcctctgt accaggaaga cagggtgaag atg gag gag agg tac 55 

Met Glu Glu Arg Tyr 
1 5 

tac ccg gtg ate ttc ccg gac gag egg aat ttc cgc eec ttc act tec 103 
Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe Arg Pro Phe Thr Ser 
10 15 20 

gac tct ctg get gee ata gag aag egg att get ate caa aag gag agg 151 
Asp Ser Leu Ala Ala lie Glu Lys Arg He Ala He Gin Lys Glu Arg 
25 30 35 

aag aag tec aaa gac aag gcg gca get gag cec cag cct egg ect cag 199 
Lys Lys Ser Lys Asp Lys Ala Ala Ala Glu Pro Gin Pro Arg Pro Gin 
40 45 50 

ctt gac eta aag gee tee agg aag tta cet aag ett tat ggt gac att 247 
Leu Asp lieu Lys Ala Ser Arg Lys Leu Pro Lys Leu Tyr Gly Asp He 
55 60 €5 



1 
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ccc cct gag ctt gta gcg aag cct ctg gaa gac ctg gac cca ttc tac 295 
Pro Pro Glu Leu Val Ala Lys Pro Leu Glu Asp Leu Asp Pro Phe Tyr 
70 75 80 85 

aaa gac cat aag aca ttc atg gtg ttg aac aag aag aga aca att tat 343 
Lys Asp His Lys Thr Phe Met Val Leu Asn Lys Lys Arg Thr He Tyr 
90 95 100 

cgc ttc age gcc aag egg gcc ttg ttc att ctg ggg cct ttt aat ccc 391 
Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu Gly Pro Phe Asn Pro 
105 110 115 

etc aga age tta atg att cgt ate tct gtc cat tea gtc ttt age atg 439 
Leu Arg Ser Leu Met He Arg He Ser Val His Ser Val Phe Ser Met 
120 125 130 

ttc ate ate tgc aeg gtg ate ate aac tgt atg ttc atg gcg aat tct 487 
Phe He He Cys Thr Val He He Asn Cys Met Phe Met Ala Asn Ser 
135 140 145 

atg gag aga agt ttc gac aac gac att ccc gaa tac gtc ttc att ggg 535 
Met Glu Arg Ser Phe Asp Asn Asp He Pro Glu Tyr Val Phe He Gly 
150 155 160 165 

att tat att tta gaa get gtg att aaa ata ttg gca aga ggc ttc att 583 
He Tyr He Leu Glu Ala Val He Lys He Leu Ala Arg Gly Phe He 
170 175 180 

gtg gat gag ttt tec ttc etc cga gat ccg tgg aac tgg ctg gac ttc 631 
Val Asp Glu Phe Ser Phe Leu Arg Asp Pro Trp Asn Trp Leu Asp Phe 
185 190 195 

att gtc att gga aca gcg ate gca act tgt ttt ccg ggc age caa gtc 679 
He Val He Gly Thr Ala He Ala Thr Cys Phe Pro Gly Ser Gin Val 
200 205 210 

aat ctt tea get ctt cgt ace ttc cga gtg ttc aga get ctg aag gcg 727 
Asn Leu Ser Ala Leu Arg Thr Phe Arg Val Phe Arg Ala Leu Lys Ala 
215 220 225 

att tea gtt ate tea ggt ctg aag gtc ate gta ggt gcc ctg ctg cgc 775 
He Ser Val He Ser Gly Leu Lys Val He Val Gly Ala Leu Leu Arg 
230 235 240 245 

teg gtg aag aag ctg gta gac gtg atg gtc etc act etc ttc tgc etc 823 
Ser Val Lys Lys Leu Val Asp Val Met Val Leu Thr Leu Phe Cys Leu 
250 255 260 

age ate ttt gcc ctg gtc ggt cag eag ctg ttc atg gga att ctg aac 871 
Ser lie Phe Ala Leu Val Gly Gin Gin Leu Phe Met Gly He Leu Asn 
265 270 275 

cag aag tgt att aag cac aac tgt ggc ccc aac cct gca tec aac aag 919 
Gin Lys Cys He Lys His Asn Cys Gly Pro Asn Pro Ala Ser Asn Lys 
280 285 290 



2 
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gat tgt ttt gaa aag gaa aaa gat age gaa gac ttc ata atg tgt ggt 967 
Asp Cys Phe Glu Lys Glu Lys Asp Ser Glu Asp Phe lie Met Cys Gly 
295 300 305 

acc tgg etc ggc age aga ccc tgt ccc aat ggt tct acg tgc gat aaa 1015 
Thr Trp Leu Gly Ser Arg Pro Cys Pro Asn Gly Ser Thr Cys Asp Lys 
310 315 320 325 

acc aca ttg aac cca gac aat aat tat aca aag ttt gac aac ttt ggc 1063 
Thr Thr Leu Asn Pro Asp Asn Asn Tyr Thr Lys Phe Asp Asn Phe Gly 
330 335 340 

tgg tec ttt etc gee atg ttc egg gtt atg act caa gac tec tgg gag llll 
Trp Ser Phe Leu Ala Met Phe Arg Val Met Thr Gin Asp Ser Trp Glu 
345 350 355 

agg ctt tac cga cag ate ctg egg acc tct ggg ate tac ttt gtc ttc 1159 
Arg Leu Tyr Arg Gin lie Leu Arg Thr Ser Gly He Tyr Phe Val Phe 
360 365 370 

ttc ttc gtg gtg gtc ate ttc ctg ggc tec ttc tac ctg ctt aac eta 1207 
Phe Phe Val Val Val He Phe Leu Gly Ser Phe Tyr Leu Leu Asn Leu 
375 380 385 

acc ctg get gtt gtc acc atg get tat gaa gaa cag aac aga aat gta 1255 
Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu Gin Asn Arg Asn Val 
390 395 400 405 

get get gag aca gag gcc aag gag aaa atg ttt cag gaa gee cag cag 1303 
Ala Ala Glu Thr Glu Ala Lys Glu Lys Met Phe Gin Glu Ala Gin Gin 
410 ' 415 420 

ctg tta agg gag gag aag gag get ctg gtt gcc atg gga att gac aga 1351 
Leu Leu Arg Glu Glu Lys Glu Ala Leu Val Ala Met Gly He Asp Arg 
425 430 435 

agt tec ctt aat tec ctt caa get tea tec ttt tec ecg aag aag agg 1399 
Ser Ser Leu Asn Ser Leu Gin Ala Ser Ser Phe Ser Pro Lys Lys Arg 
440 445 450 

aag ttt ttc ggt agt aag aca aga aag tec ttc ttt atg aga ggg tec 1447 
Lys Phe Phe Gly Ser Lys Thr Arg Lys Ser Phe Phe Met Arg Gly Ser 
455 460 465 

aag acg gcc caa gcc tea gcg tct gat tea gag gac gat gee tct aaa 1495 
Lys Thr Ala Gin Ala Ser Ala Ser Asp Ser Glu Asp Asp Ala Ser Lys 
470 475 480 485 

aat cca cag etc ctt gag cag ace aaa cga etg tee cag aac ttg eca 1543 
Asn Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin Asn Leu Pro 
490 495 500 

gtg gat etc ttt gat gag cac gtg gac ccc etc cac agg cag aga gcg 1591 
Val Asp Leu Phe Asp Glu His Val Asp Pro Leu His Arg Gin Arg Ala 
505 510 515 



3 
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ctg age get gtc agt ate tta ace ate ace atg cag gaa caa gaa aaa 1639 
Leu Ser Ala Val Ser lie Leu Thr lie Thr Met Gin Glu Gin Glu Lys 
520 525 530 

ttc cag gag cct tgt ttc cca tgt ggg aaa aat ttg gcc tct aag tac 1687 
Phe Gin Glu Pro Cys Phe Pro Cys Gly Lys Asn Leu Ala Ser Lys Tyr 
535 540 545 

ctg gtg tgg gac tgt age cct caa tgg ctg tgc ata aag aag gtc ctg 1735 
Leu Val Trp Asp Cys Ser Pro Gin Trp Leu Cys He Lys Lys Val Leu 
550 555 560. 565 

egg acc ate atg aeg gat ecc ttt act gag ctg gcc ate ace ate tgc 1783 
Arg Thr He Met Thr Asp Pro Phe Thr Glu Leu Ala He Thr He Cys 
570 575 580 

ate ate ate aat acc gtt ttc tta gcc gtg gag cac cac aac atg gat 1831 
He He He Asn Thr Val Phe Leu Ala Val Glu His His Asn Met Asp 
565 590 595 

gac aac tta aag acc ata ctg aaa ata gga aac tgg gtt ttc acg gga 1879 
Asp Asn Leu Lys Thr He Leu Lys He Gly Asn Trp Val Phe Thr Gly 
600 605 610 

att ttc ata gcg gaa atg tgt etc aag ate ate gcg etc gac cct tac 1927 
He Phe He Ala Glu Met Cys Leu Lys He He Ala Leu Asp Pro Tyr 
615 620 625 

cac tac ttc egg cac ggc tgg aat gtt ttt gac age ate gtg gcc etc 1975 
His Tyr Phe Arg His Gly Trp Asn Val Phe Asp Ser He Val Ala Leu 
630 635 640 645 

etg agt etc get gat gtg etn tac aac aca ctg tct gat aac aat agg 2023 
Leu Ser Leu Ala Asp Val Xaa Tyr Asn Thr Leu Ser Asp Asn Asn Arg 
650 655 660 

tct ttc ttg get tec etc aga gtg ctg agg gtc ttc aag tta gee aaa 2071 
Ser Phe Leu Ala Ser Leu Arg Val Leu Arg Val Phe Lys Leu Ala Lys 
665 670 675 

tec tgg cce acg tta aac act etc att aag ate ate ggc cac tec gtg 2119 
Ser Trp Pro Thr Leu Asn Thr Leu He Lys He He Gly His Ser Val 
680 685 690 

ggc gcg ctt gga aac ctg act gtg gtc ctg act ate gtg gtc ttc ate 2167 
Gly Ala Leu Gly Asn Leu Thr Val Val Leu Thr He Val Val Phe He 
695 700 705 

ttt tct gtg gtg ggc atg egg etc ttc ggc ace aag ttt aac aag acc 2215 
Phe Ser Val Val Gly Met Arg Leu Phe Gly Thr Lys Phe Asn Lys Thr 
710 715 720 725 

gcc tac gcc acc cag gag egg cce agg egg cgc tgg cac atg gat aat 2263 
Ala Tyr Ala Thr Gin Glu Arg Pro Arg Arg Arg Trp His Met Asp Asn 
730 735 740 



4 
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ttc tac cac tec ttc ctg gtg gtg ttc cgc ate cte tgt ggg gaa tgg 2311 
Phe Tyr His Ser Phe Leu Val Val Phe Arg lie Leu Cys Gly Glu Trp 
745 750 755 

ate gag aac atg tgg ggc tgc atg cag gat atg gac ggc tec eeg ttg 2359 
tie Glu Asn Met Trp Gly Cys Met Gin Asp Met Asp Gly Ser Pro Leu 
760 765 770 

tgc ate att gtc ttt gtc ctg ata atg gtg ate ggg aag ett gtg gtg 2407 
Cys lie lie Val Phe Val Leu lie Met Val He Gly Lys Leu Val Val 
775 780 785 

ctt aac etc ttc att gee ttg ctg etc aat tec ttc age aat gag gag 2455 
Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu 
790 795 800 805 

aag gat ggg age ctg gaa gga gag ace agg aaa ace aaa gtg cag eta 2503 
Lys Asp Gly Ser Leu Glu Gly Glu Thr Arg Lys Thr Lys Val Gin Leu 
810 815 820 

gee ctg gat egg ttc cgc egg gee ttc tec ttc atg ctg cac get ctt 2551 
Ala Leu Asp Arg Phe Arg Arg Ala Phe Ser Phe Met Leu His Ala Leu 
825 830 835 

cag agt ttt tgt tgc aag aaa tgc agg agg aaa aac teg cca aag cca 2599 
Gin Ser Phe Cys Cys Lys Lys Cys Arg Arg Lys Asn Ser Pro Lys Pro 
840 845 850 

aaa gag aca aca gaa age ttt get ggt gag aat aaa gac tea ate etc 2647 
Lys Glu Thr Thr Glu Ser Phe Ala Gly Glu Asn Lys Asp Ser He Leu 
855 860 865 

eeg gat geg agg ecc tgg aag gag tat gat aca gac atg get ttg tac 2695 
Pro Asp Ala Arg Pro Trp Lys Glu Tyr Asp Thr Asp Met Ala Leu Tyr 
870 875 880 885 

act gga cag gee ggg get ccg ctg gcc cca etc gea gag gta gag gac 2743 
Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro Leu Ala Glu Val Glu Asp 
890 895 900 

gat gtg gaa tat tgt ggt gaa ggc ggt gcc eta ecc acc tea caa cat 2791 
Asp Val Glu Tyr Cys Gly Glu Gly Gly Ala Leu Pro Thr Ser Gin His 
905 910 915 

agt get gga gtt cag gee ggt gac etc ect cca gag acc aag cag etc 2839 
Ser Ala Gly Val Gin Ala Gly Asp Leu Pro Pro Glu Thr Lys Gin Leu 
920 925 930 

act age ccg gat gac caa ggg gtt gaa atg gaa gta ttt tet gaa gaa 2887 
Thr Ser Pro Asp Asp Gin Gly Val Glu Met Glu Val Phe Ser Glu Glu 
935 940 945 

gat ctg cat tta age ata cag agt ect cga aag aag tet gac gea gtg 2935 
Asp Leu His lieu Ser He Gin Ser Pro Arg Lys Lys Ser Asp Ala Val 
950 955 960 965 



5 
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age atg etc teg gaa tgc age aca att gac ctg aat gat ate ttt aga 
Ser Met Leu Ser Glu Cys Ser Thr lie Asp Leu Asn Asp lie Phe Arg 
970 975 980 



2983 



aat tta cag aaa aca gtt tec ccc aaa aag cag cca gat aga tgc ttt 
Asn Leu 61n Lys Thr Val Ser Pro Lys Lys Gin Pro Asp Arg Cys Phe 
985 990 995 



3031 



ccc aag ggc ctt agt tgt cac ttt eta tgc cac aaa aca gac aag aga 
Pro Lys Gly Leu Ser Cys His Phe Leu Cys His Lys Thr Asp Lys Arg 
1000 1005 1010 



3079 



aag tec ccc tgg gtc ctg tgg tgg aac att egg aaa ace tgc tac caa 
Lys Ser Pro Trp Val Leu Trp Trp Asn lie Arg Lys Thr Cys Tyr Gin 
1015 1020 1025 



3127 



ate gtg aag cac age tgg ttt gag agt tte ata ate ttt gtt att ctg 
lie Val Lys His Ser Trp Phe Glu Ser Phe He He Phe Val He Leu 
1030 1035 1040 1045 



3175 



ctg age agt gga gcg ctg ata ttt gaa gat gtc aat etc ccc age egg 
Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val Asn Leu Pro Ser Arg 
1050 1055 1060 



3223 



ccc caa gtt gag aaa tta eta agg tgt acc gat aat att tte aca ttt 
Pro Gin Val Glu Lys Leu Leu Arg Cys Thr Asp Asn He Phe Thr Phe 
1065 1070 1075 



3271 



att tte etc ctg gaa atg ate ctg aag tgg gtg gee ttt gga tte egg 
He Phe .Leu Leu Glu Net He Leu Lys Trp Val Ala Phe Gly Phe Arg 
1080 1085 1090 



3319 



agg tat tte ace agt gee tgg tgc tgg ctt gat tte etc att gtg gtg 
Arg Tyr Phe Thr Ser Ala Trp Cys Trp Leu Asp Phe Leu He Val Val 
1095 1100 1105 



3367 



gtg tct gtg etc agt etc atg aat eta cca age ttg aag tec tte egg 
Val Ser Val Leu Ser Leu Met Asn Leu Pro Ser Leu Lys Ser Phe Arg 
1110 1115 1120 1125 



3415 



act ctg egg gcc ctg aga cct ctg egg gcg ctg tec cag ttt gaa gga 
Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser Gin Phe Glu Gly 
1130 1135 1140 



3463 



atg aag gtt gtc gtc tac gee ctg ate age gcc ata cct gcc att etc 
Met Lys Val Val Val Tyr Ala Leu He Ser Ala He Pro Ala He Leu 
1145 1150 1155 



3511 



aat gtc ttg ctg gtc tgc etc att tte tgg etc gta ttt tgt ate ttg 
Asn Val Leu Leu Val Cys Leu He Phe Trp Leu Val Phe Cys He Leu 
1160 1165 1170 



3559 



gga gta aat tta ttt tet ggg aag ttt gga agg tgc att aac ggg aca 
Gly Val Asn Leu Phe Ser Gly Lys Phe Gly Arg Cys He Asn Gly Thr 
1175 1180 1185 



3607 
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gac ata aat atg tat ttg gat ttt acc gaa gtt ccg aac cga age caa 3655 
Asp lie Aszi Met Tyr Leu Asp Phe Thr Glu Val Pro Asn Arg Ser Gin 
1190 1195 1200 1205 

tgt aac att agt aat tac teg tgg aag gte ecg eag gtc aac ttt gac 3703 
Cys Asn lie Ser Asn Tyr Ser Trp Lys Val Pro Gin Val Asn Phe Asp 
1210 1215 1220 

aac gtg ggg aat gee tat etc gee ctg etg caa gtg gca acc tat aag 3751 
Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin Val Ala Thr Tyr Lys 
1225 1230 1235 

ggc tgg ctg gaa ate atg aat get get gtc gat tec aga gag aaa gac 3799 
Gly Trp Leu Glu lie Met Asn Ala Ala Val Asp Ser Arg Glu Lys Asp 
1240 1245 1250 

gag cag ccg gac ttt gag gcg aac etc tac gcg tat etc tac ttt gtg 3847 
Glu Gin Pro Asp Phe Glu Ala Asn Leu Tyr Ala Tyr Leu Tyr Phe Val 
1255 1260 1265 

gtt ttt ate ate ttc ggc tec ttc ttt acc ctg aac etc ttt ate ggt 3895 
Val Phe He He Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly 
1270 1275 1280 1285 

gtt att att gac aac ttc aat cag cag cag aaa aag tta ggt ggc caa 3943 
Val He He Asp Asn Phe Asn Gin Gin Gin Lys Lys Leu Gly Gly Gin 
1290 1295 1300 

gac att ttt atg aca gaa gaa cag aag aaa tat tac aat gca atg aaa 3991 
Asp He Phe' Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met Lys 
1305 1310 1315 

aag tta gga acc aag aaa cct caa aag cec ate cea agg cec etg aac 4039 
Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro He Pro Arg Pro Leu Asn 
1320 1325 1330 

aan tgt caa gee ttt gtg ttc gac ctg gtc aca age cat gtc ttt gac 4087 
Xaa Cys Gin Ala Phe Val Phe Asp Leu Val Thr Ser His Val Phe Asp 
1335 1340 1345 

gtc ate att ctg ggt ett att gtc tta aat atg att ate atg atg get 4135 
Val He He Leu Gly Leu He Val Leu Asn Met He He Met Met Ala 
1350 1355 1360 1365 

gaa tct gcc gac cag cec aaa gat gtg aag aaa acc ttt gat ate etc 4183 
Glu Ser Ala Asp Gin Pro Lys Asp Val Lys Lys Thr Phe Asp He Leu 
1370 1375 1380 

aac ata gcc ttc gtg gtc ate ttt acc ata gag tgt etc ate aaa gte 4231 
Asn He Ala Phe Val Val He Phe Thr He Glu Cya Leu He Lys Val 
1385 1390 1395 

ttt get ttg agg caa eac tac ttc ace aat ggc tgg aac tta ttt gat 4279 
Phe Ala Leu Arg Gin His Tyr Phe Thr Asn Gly Trp Asn Leu Phe Asp 
1400 1405 1410 
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tgt gtg gtc gtg gtt ctt tct ate att agt acc ctg gtt tec cgc ttg 4327 
Cys Val val Val Val Leu Ser lie lie Ser Thr Leu Val Ser Arg Leu 
14X5 1420 1425 

gag gac agt gac att tct ttc ccg ccc acg etc ttc aga gtc gtc cgc 4375 
Glu Asp Ser Asp He Ser Phe Pro Pro Thr Leu Phe Arg Val Val Arg 
1430 1435 1440 1445 

ttg get egg att ggt cga ate etc agg ctg gtc egg get gee egg gga 4423 
Leu Ala Arg He Gly Arg He Leu Arg Leu Val Arg Ala Ala Arg Gly 
1450 1455 1460 

ate agg ace etc etc ttt get ttg atg atg tet etc cce tct etc ttc 4471 
He Arg Thr Leu Leu Phe Ala Leu Met Met Ser Leu Pro Ser Leu Phe 
1465 1470 1475 

aac ate ggt ctg ctg etc ttc ctg gtg atg ttc att tae gee ate ttt 4519 
Asn He Gly Leu Leu Leu Phe Leu Val Met Phe He Tyr Ala He Phe 
1480 1485 1490 

ggg atg age tgg ttt tec aaa gtg aag aag ggc tee ggg ate gac gac 4567 
Gly Met Ser Trp Phe Ser Lys Val Lys Lys Gly Ser Gly He Asp Asp 
1495 1500 1505 

ate ttc aac ttc gag acc ttt acg ggc age atg ctg tgc etc ttc cag 4615 
He Phe Asn Phe Glu Thr Phe Thr Gly Ser Met Leu Cys Leu Phe Gin 
1510 1515 1520 1525 

ata ace act teg get ggc tgg gat ace etc etc aac ccc atg ctg gag 4663 
He Thr Thr Ser Ala Gly Trp Asp Thr Leu Leu Asn Pro Met Leu Glu 
1530 1535 1540 

gca aaa gaa cac tgc aac tec tec tec caa gac age tgt cag cag ccg 4711 
Ala Lys Glu His Cys Asn Ser Ser Ser Gin Asp Ser Cys Gin Gin Pro 
1545 1550 1555 

cag ata gee gtc gtc tae ttc gtc agt tae ate ate ate tec ttc etc 4759 
Gin He Ala Val Val Tyr Phe val Ser Tyr He He He Ser Phe Leu 
1560 1565 1570 

ate gtg gtc aac atg tac ate get gtg ate etc gag aac ttc aac aca 4807 
He Val Val Asn Met Tyr He Ala Val He Leu Glu Asn Phe Asn Thr 
1575 1580 1585 

gee acg gag gag ago gag gac cct ctg gga gag gac gac ttt gaa ate 4855 
Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu Asp Asp Phe Glu He 
1590 1595 1600 1605 

ttc tat gag gtc tgg gag aag ttt gac ccc gag gcg teg cag ttc ate 4903 
Phe Tyr Glu Val Trp Glu Lys Phe Asp Pro Glu Ala Ser Gin Phe He 
1610 1615 1620 

cag tat teg gee etc tct gac ttt gcg gac gee ctg ccg gag ccg ttg 4951 
Gin Tyr Ser Ala Leu Ser Asp Phe Ala Asp Ala Leu Pro Glu Pro Leu 
1625 1630 1635 
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cgt gtg gcc aag ccg aat aag ttt cag ttt eta gtg atg gac ttg ccc 4999 
Arg Val Ala Lys Pro Asn Lys Phe Gin Phe Leu Val Met Asp Leu Pro 
1640 1645 1650 

atg gtg atg ggc gac cgc etc cat tge atg gat gtt etc ttt get ttc • 5047 
Met Val Met Gly Asp Arg Leu His Cys Met Asp Val Leu Phe Ala Phe 
1655 1660 1665 

act acc agg gtc etc ggg gac tec age ggc ttg gat ace atg aaa ace 5095 
Thr Thr Arg val Leu Gly Asp Ser ser Gly Leu Asp Thr Met Lys Thr 
1670 1675 1680 1685 

atg atg gag gag aag ttt atg gag gcc aac cct ttt aag aag etc tac 5143 
Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro Phe Lys Lys Leu Tyr 
1690 1695 1700 

gag ccc ata gtc acc acc ace aag agg aag gag gag gag caa ggc gcc 5191 
Glu Pro lie val Thr Thr Thr Lys Arg Lys Glu Glu Glu Gin Gly Ala 
1705 1710 1715 

gcc gtc ate cag agg gcc tac egg aaa cac atg gag aag atg gtc aaa 5239 
Ala Val He Gin Arg Ala Tyr Arg Lys His Met Glu Lys Met Val Lys 
1720 1725 1730 

ctg agg ctg aag gac agg tea agt tea teg cac cag gtg ttt tge aat 5287 
Leu Arg Leu Lys Asp Arg Ser Ser Ser Ser His Gin Val Phe Cys Asn 
1735 1740 1745 

gga gac ttg tec age ttg gat gtg gcc aag gtc aag gtt cac aat gac 5335 
Gly Asp Leu Ser Ser Leu Asp Val Ala Lys Val Lys Val His Asn Asp 



1750 


1755 1760 




1765 




tgaaccctca 


tctccaccce 


taceteactg ceteaeaget 


tagectceag 


cetctggega 


5395 


geaggcggca 


gactcactga 


acacaggecg ttegatctgt 


gtttttggct 


gaaegaggtg 


5455 


acaggttgge 


gtceattttt 


aaatgactet tggaaagatt 


tcatgtagag 


agatgttaga 


5515 


agggactgca 


aaggaeaccg 


accataaegg aaggcctgga 


ggacagtcea 


aettaeataa 


5575 


agatgagaaa 


caagaaggaa 


agatcccagg aaaaettcag 


attgtgttct 


cagtacattc 


5635 


eccaatgtgt 


etgtteggtg 


ttttgagtat gtgacctgee 


acatgtagct 


cttttttgca 


5695 


tgtacgtcaa 


aaccctgcag 


taagttaata gcttgctaeg 


ggtgttccta 


ccagcatcac 


5755 


agaattgggt 


gtatgaetca 


aacctaaaag catgaetetg 


acttgtcagt 


cagcaeeceg 


5815 


actttcagac 


gctccaatct 


etgteccagg tgtctaaega 


ataaataggt 


aaaagaaaaa 


5875 



<210> 2 
<211> 1765 
<212> PRT 

<213> Rattus norvegieus 
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<400> 2 

Met Glu Glu Arg Tyr Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe 
IS 10 15 

Arg Pro Phe Thr Ser Asp Ser Leu Ala Ala lie Glu Lys Arg lie Ala 
20 25 30 

lie Gin Lys Glu Arg Lys Lys Ser Lys Asp Lys Ala Ala Ala Glu Pro 
35 40 45 

Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 
50 55 60 

Leu Tyr Gly Asp He Pro Pro Glu Leu Val Ala Lys Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Phe Tyr Lys Asp His Lys Thr Phe Met Val Leu Asn Lys 
85 90 95 

Lys Arg Thr He Tyr Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu 
100 105 110 

Gly Pro Phe Asn Pro Leu Arg Ser Leu Met He Arg He Ser Val His 
115 X20 125 

Ser Val Phe Ser Met Phe He He Cys Thr Val He He Asn Cys Met 
130 135 140 

Phe Met Ala Asn Ser Met Glu Arg Ser Phe Asp Asn Asp He Pro Glu 
145 150 155 160 

Tyr Val Phe He Gly He Tyr He Leu Glu Ala Val He Lys He Leu 
165 170 175 

Ala Arg Gly Phe He Val Asp Glu Phe Ser Phe Leu Arg Asp Pro Txp 
180 185 190 

Asn Trp Leu Asp Phe He Val He Gly Thr Ala He Ala Thr Cys Phe 
195 200 205 

Pro Gly Ser Gin Val Asn Leu Ser Ala Leu Arg Thr Phe Arg Val Phe 
210 215 220 

Arg Ala Leu Lys Ala He Ser Val He Ser Gly Leu Lys Val He Val 
225 230 235 240 

Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asp Val Met Val Leu 
245 250 255 

Thr Leu Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin Gin Leu Phe 
260 265 270 

Met Gly He Leu Asn Gin Lys Cys He Lys His Asn Cys Gly Pro Asn 
275 280 285 

Pro Ala Ser Asn Lys Asp Cys Phe Glu Lys Glu Lys Asp Ser Glu Asp 
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290 



295 



300 



Phe lie Met Cya Qly Thr Trp Leu Gly Ser Arg Pro Cys Pro Asn Gly 
305 310 315 320 

Ser Thr Cys Asp Lys Thr Thr Leu Asn Pro Asp Asn Asn Tyr Thr Lys 
325 330 335 

Phe Asp Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Val Met Thr 
340 345 350 

Gin Asp Ser Trp Glu Arg Leu Tyr Arg Gin He Leu Arg Thr Ser Gly 
355 360 365 

lie Tyr Phe Val Phe Phe Phe Val Val Val He Phe Leu Gly Ser Phe 
370 375 380- 

Tyr Leu Leu Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu 
385 390 395 400 

Gin Asn Arg Asn Val Ala Ala Glu Thr Glu Ala Lys Glu Lys Met Phe 
405 410 415 

Gin Glu Ala Gin Gin Leu Leu Arg Glu Glu Lys Glu Ala Leu Val Ala 
420 425 430 

Met Gly lie Asp Arg Ser Ser Leu Asn Ser Leu Gin Ala Ser Ser Phe 
435 440 445 

Ser Pro Lys Lys Arg Lys Phe Phe Gly Ser Lys Thr Arg Lys Ser Phe 
450 455 460 

Phe Met Arg Gly Ser Lys Thr Ala Gin Ala Ser Ala Ser Asp Ser Glu 
465 470 475 480 

Asp Asp Ala Ser Lys Asn Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu 
485 490 495 

Ser Gin Asn Leu Pro Val Asp Leu Phe Asp Glu His Val Asp Pro Leu 
500 505 510 

His Arg Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met 
515 520 525 

Gin Glu Gin Glu Lys Phe Gin Glu Pro Cys Phe Pro Cys Gly Lys Asn 
530. 535 540 

Leu Ala Ser Lys Tyr Leu Val Trp Asp Cys Ser Pro Gin Trp Leu Cys 
545 550 555 560 

He Lys Lys Val Leu Arg Thr He Met Thr Asp Pro Phe Thr Glu Leu 
565 570 575 



Ala He Thr He Cys He He He Asn Thr Val Phe Leu Ala Val Glu 
580 585 590 
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His His Asn Met Asp Asp Asn Leu Lys Thr lie Leu Lys He Gly Asn 
"5 600 605 

Trp Val Phe Thr Gly He Phe He Ala Glu Met Cys Leu Lys He He 

615 620 

Ala Leu Asp Pro Tyr His Tyr Phe Arg His Gly Trp Asn Val Phe Asp 
«30 

Ser He Val Ala Leu Leu Ser Leu Ala Asp Val Xaa Tyr Asn Thr Leu 
645 650 

Ser Asp Asn Asn Arg Ser Phe Leu Ala Ser Leu Arg Val Leu Arg Val 
660 665 670 

Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He 
^■^5 680 685 

He Gly His Ser Val Gly Ala Leu Gly Asn Leu Thr Val Val Leu Thr 

695 700 

He Val Val Phe He Phe Ser Val Val Gly Met Arg Leu Phe Gly Thr 
'10 715 720 

Lys Phe Asn Lys Thr Ala Tyr Ala Thr Gin Glu Arg Pro Arg Arg Arg 
'25 730 735 

Trp His Met Asp Asn Phe Tyr His Ser Phe Leu Val Val Phe Arg He 
'40 745 750 

Leu Cys Gly Glu Trp He Glu Asn Met Trp Gly Cys Met Gin Asp Met 
755 760 765 

Asp Gly Ser Pro Leu Cys He He Val Phe Val Leu He Met Val He 

775 780 

Gly Lys Leu Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser 
'90 795 800 

Phe Ser Aan Glu Glu Lys Asp Gly Ser Leu Glu Gly Glu Thr Arg Lys 
805 810 815 

Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Arg Arg Ala Phe Ser Phe 
820 825 830 

Met Leu His Ala Leu Gin Ser Phe Cys Cys Lys Lys Cys Arg Arg Lys 
835 840 845 

Asn Ser Pro Lys Pro Lys Glu Thr Thr Glu Ser Phe Ala Gly Glu Asn 
850 855 860 

Lys Asp ser He Leu Pro Asp Ala Arg Pro Tip Lys Glu Tyr Asp Thr 
8'0 875 880 

Asp Met Ala Leu Tyr Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro Leu 
885 890 895 
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Ala Glu Val Glu Asp Asp Val Glu Tyr Cys Gly Glu Gly Gly Ala Leu 
900 905 910 

Pro Thr ser Gin His Ser Ala Gly Val Gin Ala Gly Asp Leu Pro Pro 
515 920 925 

Glu Thr Lys Gin Leu Thr Ser Pro Asp Asp Gin Gly Val Glu Met Glu 
930 935 940 

Val Phe Ser Glu Glu Asp Leu His Leu Ser He Gin Ser Pro Arg Lys 
950 955 960 

Lys Ser Asp Ala Val Ser Met Leu Ser Glu Cys Ser Thr lie Asp Leu 
965 970 975 

Asn Asp He Phe Arg Asn Leu Gin Lys Thr Val Ser Pro Lys Lys Gin 
980 985 990 

Pro Asp Arg Cys Phe Pro Lys Gly Leu Ser Cys His Phe Leu Cys His 
995 1000 1005 

Lys Thr Asp Lys Arg Lys Ser Pro Trp Val Leu Trp Trp Asn He Arg 
1010 1015 1020 

Lys Thr Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He 
1025 1030 1035 1040 

He Phe Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val 
1045 1050 1055 

Asn Leu Pro Ser Arg Pro Gin Val Glu Lys Leu Leu Arg Cys Thr Asp 
1060 1065 1070 

Asn He Phe Thr Phe He Phe Leu Leu Glu Met He Leu Lys Trp Val 
1075 1080 1085 

Ala Phe Gly Phe Arg Arg Tyr Phe Thr Ser Ala Trp Cys Trp Leu Asp 
1090 1095 1100 

Phe Leu He Val Val Val Ser Val Leu Ser Leu Met Asn Leu Pro Ser 
1105 1110 1115 1120 

Leu Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu 
1125 1130 ' 1135 

Ser Gin Phe Glu Gly Met Lys Val Val Val Tyr Ala Leu He Ser Ala 
1140 1145 1150 

He Pro Ala He Leu Asn Val Leu Leu Val Cys Leu He Phe Trp Leu 
1155 1160 1165 

Val Phe Cys He Leu Gly Val Asn Leu Phe Ser Gly Lys Phe Gly Arg 
1170 1175 1180 

Cys He Asn Gly Thr Asp He Asn Met Tyr Leu Tlsp Phe Thr Glu Val 
1185 1190 1195 1200 
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Pro Asn Arg Ser Gin Cys Asn He Ser Asn Tyr Ser Trp Lys Val Pro 
1205 1210 1215 

Gin Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin 
1220 1225 1230 

Val Ala Thr Tyr Lys Gly Trp Leu Glu He Met Asn Ala Ala Val Asp 
1235 1240 1245 

Ser Arg Glu Lys Asp Glu Gin Pro Asp Phe Glu Ala Asn Leu Tyr Ala 
1250 1255 1260 

Tyr Leu Tyr Phe Val Val Phe He He Phe Gly Ser Phe Phe Thr Leu 
1265 1270 1275 i280 

Asn Leu Phe He Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys 
1285 1290 1295 

Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr 
1300 1305 1310 

Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro He 
1315 1320 1325 

Pro Arg Pro Leu Asn Xaa Cys Gin Ala Phe Val Phe Asp Leu Val Thr 
1330 1335 1340 

Ser His Val Phe Asp Val He He Leu Gly Leu He Val Leu Asn Met 
1350 1355 1360 

He He Met Met Ala Glu Ser Ala Asp Gin Pro Lys Asp Val Lys Lys 
1365 1370 1375 

Thr Phe Asp He Leu Asn He Ala Phe Val Val He Phe Thr He Glu 
1380 1385 1390 

Cys Leu He Lys Val Phe Ala Leu Arg Gin His Tyr Phe Thr Asn Gly 
1395 1400 1405 

Trp Asn Leu Phe Asp Cys Val Val Val Val Leu Ser He He Ser Thr 
1410 1415 1420 

Leu Val Ser Arg Leu Glu Asp Ser Asp He Ser Phe Pro Pro Thr Leu 
1425 1430 1435 1440 

Phe Arg Val Val Arg Leu Ala Arg He Gly Arg He Leu Arg Leu Val 
1445 1450 1455 

Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe Ala Leu Met Met Ser 
1460 1465 1470 

Leu Pro Ser Leu Phe Asn He Gly Leu Leu Leu Phe Leu Val Met Phe 
1475 1480 1485 

He Tyr Ala He Phe Gly Met Ser Trp Phe Ser Lys Val Lys Lys Gly 
1490 1495 1500 



14 



wo 01/05831 



PCTAJSOO/19342 



Ser Gly lie Asp Asp lie Phe Asn Phe Glu Thr Phe Thr Gly Ser Met 
1505 1510 1515 1520 

Leu Cys Leu Phe Gin He Thr Thr Ser Ala Gly Trp Asp Thr Leu Leu 
1525 1530 1535 

Asn Pro Met Leu Glu Ala Lys Glu His Cys Asn Ser Ser Ser Gin Asp 
1540 1545 1550 

Ser Cys Gin Gin Pro Gin He Ala Val Val Tyr Phe Val Ser Tyr He 
1555 1560 1565 

He He Ser Phe Leu He Val Val Asn Met Tyr He Ala Val He Leu 
1570 1575 1580 

Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu 
1585 1590 1595 1600 

Asp Asp Phe Glu He Phe Tyr Glu Val Trp Glu Lys Phe Asp Pro Glu 
1605 1610 1615 

Ala Ser Gin Phe He Gin Tyr Ser Ala Leu Ser Asp Phe Ala Asp Ala 
1620 1625 1630 

Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Lys Phe Gin Phe Leu 
1635 1640 1645 

Val Met Asp Leu Pro Met Val Met Gly Asp Arg Leu His Cys Met Asp 
1650 1655 1660 

Val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly Asp Ser Ser Gly Leu 
1665 1670 1675 1680 

Asp Thr Met Lys Thr Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro 
1685 1690 1695 

Phe Lys Lys Leu Tyr Glu Pro He Val Thr Thr Thr Lys Arg Lys Glu 
1700 1705 1710 

Glu Glu Gin Gly Ala Ala Val He Gin Arg Ala Tyr Arg Lys His Met 
1715 1720 1725 

Glu Lys Met Val Lys Leu Arg Leu Lys Asp Arg Ser Ser Ser Ser His 
1730 1735 1740 

Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu Asp Val Ala Lys Val 
1745 1750 1755 1760 

Lys Val His Asn Asp 
1765 



<210> 3 
<211> 1765 
<212> PRT 
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<213> Rattus norvegicus 
<220> 

<223> putative amino acid seq. of rat NaN 
<400> 3 

Met Glu Glu Arg Tyr Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe 
15 10 15 

Arg Pro Phe Thr Ser Asp Ser Leu Ala Ala lie Glu Lys Arg lie Ala 
20 25 30 

He Gin Lys Glu Arg Lys Lys Ser Lys Asp Lys Ala Ala Ala Glu Pro 
35 40 45 

Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 
50 55 60 

Leu Tyr Gly Asp lie Pro Pro Glu Leu Val Ala Lys Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Phe Tyr Lys Asp His Lys Thr Phe Met Val Leu Asn Lys 
85 90 95 

Lys Arg Thr lie Tyr Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu 
100 105 110 

Gly Pro Phe Asn Pro Leu Arg Ser Leu Net He Arg He Ser Val His 
115 120 125 

Ser Val Phe Ser Met Phe He He Cys Thr Val He He Asn Cys Met 
130 135 140 

Phe Met Ala Asn Ser Met Glu Arg Ser Phe Asp Asn Asp He Pro Glu 
145 ISO 155 160 

Tyr Val Phe He Gly He Tyr He Leu Glu Ala Val He Lys He Leu 
165 170 175 

Ala Arg Gly Phe He Val Asp Glu Phe Ser Phe Leu Arg Asp Pro Trp 
180 185 190 

Asn Trp Leu Asp Phe He Val He Gly Thr Ala He Ala Thr Cys Phe 
195 200 205 

Pro Gly Ser Gin Val Asn Leu Ser Ala Leu Arg Thr Phe Arg Val Phe 
210 215 220 

Arg Ala Leu Lys Ala He Ser Val He Ser Gly Leu Lys Val He Val 
225 230 235 . 240 

Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asp Val Met Val Leu 
245 250 255 

Thr Leu Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin Gin Leu Phe 
260 265 270 



16 



wo 01/05831 



PCT/USOO/19342 



Met Gly He Leu Asn Gin Lys Cys He Lys His Asn Cys Gly Pro Ash 
275 280 285 

Pro Ala Ser Asn Lys Asp Cys Phe Glu Lys Glu Lys Asp Ser Glu Asp 
290 295 300 

Phe He Met Cys Gly Thr Trp Leu Gly Ser Arg Pro Cys Pro Asn Gly 
310 315 320 

Ser Thr Cys Asp Lys Thr Thr Leu Asn Pro Asp Asn Asn Tyr Thr Lys 
325 330 335 

Phe Asp Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Val Met Thr 
340 345 350 

Gin Asp Ser Trp Glu Arg Leu Tyr Arg Gin He Leu Arg Thr Ser Gly 
355 360 365 

He Tyr Phe Val Phe Phe Phe Val Val Val He Phe Leu Gly Ser Phe 
370 375 380 

Tyr Leu Leu Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu 
390 395 400 

Gin Asn Arg Asn Val Ala Ala Glu Thr Glu Ala Lys Glu Lys Met Phe 
405 410 415 

Gin Glu Ala Gin Gin Leu Leu Arg Glu Glu Lys Glu Ala Leu Val Ala 
420 425 430 

Met Gly He Asp Arg Ser Ser Leu Asn Ser Leu Gin Ala Ser Ser Phe 
435 440 445 

Ser Pro Lys Lys Arg Lys Phe Phe Gly Ser Lys Thr Arg Lys Ser Phe 
450 455 460 

Phe Met Arg Gly Ser Lys Thr Ala Gin Ala Ser Ala Ser Asp Ser Glu 
470 475 480 

Asp Asp Ala Ser Lys Asn Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu 
485 490 495 

Ser Gin Asn Leu Pro Val Asp Leu Phe Asp Glu His Val Asp Pro Leu 
500 505 510 

His Arg Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met 
515 520 525 

Gin Glu Gin Glu Lys Phe Gin Glu Pro Cys Phe Pro Cys Gly Lys Asn 
530 535 540 



Leu Ala Ser Lys Tyr Leu Val Trp Asp Cys Ser Pro Gin Trp Leu Cys 
545 550 555 560 

He Lys Lys Val Leu Arg Thr He Met Thr Asp Pro Phe Thr Glu Leu 
565 570 575 
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Ala lie Thr lie Cys lie lie lie Asn Thr Val Phe Leu Ala Val Glu 
580 585 590 

His His Asn Met Asp Asp Asn Leu Lys Thr lie Leu Lys lie Gly Asn 
595 600 605 

Trp Val Phe Thr Gly lie Phe He Ala Glu Met Cys Leu Lys He He 
610 615 620 

Ala Leu Asp Pro Tyr His Tyr Phe Arg His Gly Trp Asn Val Phe Asp 
625 630 635 640 

Ser He Val Ala Leu Leu Ser Leu Ala Asp Val Leu Tyr Asn Thr Leu 
645 650 655 

Ser Asp Asn Asn Arg Ser Phe Leu Ala Ser Leu Arg Val Leu Arg Val 
660 665 670 

Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He 
675 680 685 

He Gly His Ser Val Gly Ala Leu Gly Asn Leu Thr Val Val Leu Thr 
690 695 700 

He Val Val Phe He Phe Ser Val Val Oly Met Arg Leu Phe Gly Thr 

705 710 715 720 

Lys Phe Asn Lys Thr Ala Tyr Ala Thr Gin Glu Arg Pro Arg Arg Arg 
725 730 735 

Trp His Met Asp Asn Phe Tyr His Ser Phe Leu Val Val Phe Arg He 
740 745 750 

Leu Cys Gly Glu Trp He Glu Asn Met Trp Gly Cys Met Gin Asp Met 
755 760 765 

Asp Gly Ser Pro Leu Cys He He Val Phe Val Leu He Met Val He 
770 775 780 

Gly Lys Leu Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser 
785 790 795 800 

Phe Ser Asn Glu Glu Lys Asp Gly Ser Leu Glu Gly Glu Thr Arg Lys 
805 810 815 

Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Arg Arg Ala Phe Ser Phe 
820 825 830 

Met Leu His Ala Leu Gin Ser Phe Cys Cys Lys Lys Cys Arg Arg Lys 
835 840 845 

Asn Ser Pro Lys Pro Lys Glu Thr Thr Glu Ser Phe Ala Gly Glu Asn 
850 855 860 

Lys Asp Ser He Leu Pro Asp Ala Arg Pro Trp Lys Glu Tyr Asp Thr 
865 870 875 880 
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Asp Met Ala Leu Tyr Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro Leu 
665 890 695 

Ala Glu Val Glu Asp Asp Val Glu Tyr Cys Gly Glu Gly Gly Ala Leu 
900 905 910 

Pro Thr Ser Gin His Ser Ala Gly Val Gin Ala Gly Asp Leu Pro Pro 
915 920 925 

Glu Thr Lys Gin Leu Thr Ser Pro Asp Asp Gin Gly Val Glu Met Glu 
930 935 940 

Val Phe Ser Glu Glu Asp Leu His Leu Ser lie Gin Ser Pro Arg Lys 
945 950 955 960 

Lys Ser Asp Ala Val Ser Met Leu Ser Glu Cys Ser Thr lie Asp Leu 
965 970 975 

Asn Asp lie Phe Arg Asn Leu Gin Lys Thr Val Ser Pro Lys Lys Gin 
980 985 990 

Pro Asp Arg Cys Phe Pro Lys Gly Leu Ser Cys His Phe Leu Cys His 
995 lOOO 1005 

Lys Thr Asp Lys Arg Lys Ser Pro Trp Val Leu Trp Trp Asn lie Arg 
1010 1015 1020 

Lys Thr Cys Tyr Gin lie Val Lys His Ser Trp Phe Glu Ser Phe lie 
1025 1030 1035 1040 

He Phe Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val 
1045 1050 1055 

Asn Leu Pro Ser Arg Pro Gin Val Glu Lys Leu Leu Arg Cys Thr Asp 
1060 1065 1070 

Asn He Phe Thr Phe He Phe Leu Leu Glu Met He Leu Lys Trp Val 
1075 1080 1085 

Ala Phe Gly Phe Arg Arg Tyr Phe Thr Ser Ala Trp Cys Trp Leu Asp 
1090 1095 1100 

Phe Leu He Val Val Val Ser Val Leu Ser Leu Met Asn Leu Pro Ser 
1105 1110 1115 1120 

Leu Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu 
1125 1130 1135 

Ser Gin Phe Glu Gly Met Lys Val Val Val Tyr Ala Leu He Ser Ala 
1140 1145 1150 

He Pro Ala He Leu Asn Val Leu Leu val Cys Leu He Phe Trp Leu 
1155 1160 1165 

Val Phe Cys He Leu Gly Val Asn Leu Phe Ser Gly Lys Phe Gly Arg 
1170 1175 1180 
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Cys lie Asn Gly Thr Asp He Asn Met Tyr Leu Asp Phe Thr Qlu Val 
1185 1190 1195 1200 

Pro Asn Arg Ser Gin Cys Asn He Ser Asn Tyr Ser Trp Lys Val Pro 
1205 1210 1215 

Gin Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin 
1220 1225 1230 

Val Ala Thr Tyr Lys Gly Trp Leu Glu He Met Asn Ala Ala Val Asp 
1235 1240 1245 

Ser Arg Glu Lys Asp Glu Gin Pro Asp Phe Glu Ala Asn Leu Tyr Ala 

1250 1255 1260 

Tyr Leu Tyr Phe Val Val Phe He He Phe Gly Ser Phe Phe Thr Leu 
1265 1270 1275 1280 

Asn Leu Phe He Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys 
1285 1290 1295 

Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr 
1300 1305 1310 

Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro He 
1315 1320 1325 

Pro Arg Pro Leu Asn Arg Cys Gin Ala Phe Val Phe Asp Leu Val Thr 
1330 1335 1340 

Ser His Val Phe Asp Val He He Leu Gly Leu He Val Leu Asn Met 
1345 1350 1355 1360 

He He Met Met Ala Glu Ser Ala Asp Gin Pro Lys Asp Val Lys Lys 
1365 1370 1375 

Thr Phe Asp He Leu Asn He Ala Phe Val Val He Phe Thr He Glu 
1380 1385 1390 

Cys Leu He Lys Val Phe Ala Leu Arg Gin His Tyr Phe Thr Asn Gly 
1395 1400 1405 

Trp Asn Leu Phe Asp Cys Val Val Val Val Leu Ser He He Ser Thr 
1410 1415 1420 

Leu Val Ser Arg Leu Glu Asp Ser Asp He Ser Phe Pro Pro Thr Leu 
1425 1430 1435 1440 

Phe Arg Val Val Arg Leu Ala Arg He Gly Arg He Leu Arg Leu Val 
1445 1450 1455 

Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe Ala Leu Met Met Ser 
1460 1465 1470 

Leu Pro Ser Leu Phe Asn He Gly Leu Leu Leu Phe Leu Val Met Phe 
1475 1480 1485 
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lie Tyr Ala lie Phe Gly Met Ser Trp Phe Ser Lys Val Lys Lys Gly 
1490 1495 1500 

Ser Gly lie Asp Asp lie Phe Asn Phe Glu Thr Phe Thr Gly Ser Met 
1505 1510 1515 1520 

lieu Cys Leu Phe Gin lie Thr Thr Ser Ala Gly Trp Asp Thr Leu Leu 
1525 1530 1535 

Asn Pro Met Leu Glu Ala Lys Glu His Cys Asn Ser Ser Ser Gin Asp 
1540 1545 1550 

Ser Cys Gin Gin Pro Gin lie Ala Val Val Tyr Phe Val Ser Tyr lie 
1555 1560 1565 

He He Ser Phe Leu He Val Val Asn Met Tyr He Ala Val He Leu 
1570 1575 1580 

Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu 
1585 1590 1595 1600 

Asp Asp Phe Glu He Phe Tyr Glu Val Trp Glu Lys Phe Asp Pro Glu 
1605 1610 1615 

Ala Ser Gin Phe He Gin Tyr Ser Ala Leui Ser Asp Phe Ala Asp Ala 
1620 1625 1630 

Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Lys Phe Gin Phe Leu 
1635 1640 1645 

Val Met Asp Leu Pro Met Val Met Gly Asp Arg Leu His Cys Met Asp 
1650 1655 1660 

Val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly Asp Ser Ser Gly Leu 
1665 1670 1675 1680 

Asp Thr Met Lys Thr Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro 
1685 1690 1695 

Phe Lys Lys Leu Tyr Glu Pro He Val Thr Thr Thr Lys Arg Lys Glu 
1700 1705 1710 

Glu Glu Gin Gly Ala Ala Val He Gin Arg Ala Tyr Arg Lys His Met 
1715 1720 1725 

Glu Lys Met Val Lys Leu Arg Leu Lys Asp Arg Ser Ser Ser Ser His 
1730 1735 1740 

Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu Asp Val Ala Lys Val 
1745 1750 1755 1760 

Lys Val His Asn Asp 
1765 



<210> 4 
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<211> 5822 

<23.2> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (19) . . (5313) 

<220> 

<221> iinsure 
<222> (5804) 

<223> cDNA sequence of mouse NaH, n » a or c or g or t 

<400> 4 

tctgagccaa gggtgaag atg gag gag agg tac tat cca gtg ate ttc cca 51 
Met Glu Glu Arg Tyr Tyr Pro Val He Phe Pro 
15 10 

gac gag agg aat ttc cgc ccc ttc act ttc gac tct ttg get gca ata 99 
Asp Glu Arg Asn Phe Arg Pro Phe Thr Phe Asp Ser Leu Ala Ala He 
15 20 25 

gag aag egg ate acc ate caa aag gag aag aag aaa tec aaa gac aag 147 
Glu Lys Arg He Thr He Gin hya Glu Lys Lys Lys Ser Lys Asp Lys 
30 35 40 

gca gca act gag ccc cag cet egg cct cag etc gac eta aag gee tec 195 
Ala Ala Thr Glu Pro Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser 
45 50 55 

agg aag tta cct aag etc tat ggc gac gtt ccc cct gac ctt ata gcg 243 
Arg Lys Leu Pro Lys Leu Tyr Gly Asp Val Pro Pro Asp Leu He Ala 
60 65 70 75 

aag ccc ctg gaa gat ctg gac cca ttt tac aaa gac cat aag aca ttc 291 
Lys Pro Leu Glu Asp Leu Asp Pro Phe Tyr Lys Asp His Lys Thr Phe 
80 85 90 

atg gta ttg aac aag aag aga aca ate tat cgc ttc age gee aag agg 339 
Met Val Leu Asn Lys Lys Arg Thr He Tyr Arg Phe Ser Ala Lys Arg 
95 100 105 

gee ttg ttc att ctg ggg cct ttt aat ccc ate aga age ttc atg att 387 
Ala Leu Phe He Leu Gly Pro Phe Asn Pro He Arg Ser Phe Met He 

110 115 120 

cgc ate tct gtc cat tea gtc ttc age atg ttc att ate tgc aca gtg 435 
Arg He Ser Val His Ser Val Phe Ser Met Phe He He Cys Thr Val 
125 130 135 

ate ate aac tgt atg ttc atg get aat aat tct tct gtg gac agt cgt 483 
He He Asn Cys Met Phe Met Ala Asn Asn Ser Ser Val Asp Ser Arg 
140 145 150 155 

cct age agt aac att ccc gaa tac gtc ttc att ggg att tat gtt tta 531 
Pro Ser Ser Asn He Pro Glu Tyr Val Phe He Gly He Tyr Val Leu 
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160 165 170 

gaa get gtg att aaa ata ttg gca aga ggc ttc att gtg gat gag ttt 579 
Glu Ala Val lie Lys lie Leu Ala Arg Gly Phe He Val Asp Glu Phe 
175 . 180 185 

tec tac etc ega gat cet tgg aac tgg ctg gac tte att gte ate gga 627 
Ser Tyr Leu Arg Asp Pro Trp Asn Trp Leu Asp Phe He Val He Gly 
190 195 200 

aca gcg ata gcg cct tgt ttt etc ggt aac aaa gte aat aat ctt tec 675 
Thr Ala lie Ala Pro Cys Phe Leu Gly Asn Lys Val Asn Asn Leu Ser 
205 210 215 

act eta cgt acc ttc cga gtg ttg aga get ctg aaa gcc att tct gta 723 
Thr Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Ala He Ser Val 
220 225 230 235 

ate tea ggt ctg aag gte ate gtg ggt gee ctg ctg cgc tec gtg aag 771 
He Ser Gly Leu Lys Val He Val Gly Ala Leu Leu Arg Ser Val Lys 
240 245 250 

aag eta gtg gac gtg atg gte etc act etc ttt tgc etc age ate ttt 819 
Lys Leu Val Asp Val Met Val Leu Thr Leu Phe Cys Leu Ser He Phe 
255 260 265 

gcc ctg gtt ggt cag cag etc ttc atg gga att ctg age cag aaa tgt 867 
Ala Leu Val Gly Gin Gin Leu Phe Met Gly He Leu Ser Gin Lys Cys 
270 275 280 

att aag gac gac tgt ggc cet aac get ttt tec aac aag gat tgc ttt 915 
He Lys Asp Asp Cys Gly Pro Asn Ala Phe Ser Asn Lys Asp Cys Phe 
285 290 295 

gta aaa gaa aat gat age gag gac ttc ata atg tgt ggc aac tgg etc 963 
Val Lys Glu Asn Asp Ser Glu Asp Phe He Met Cys Gly Asn Trp Leu 
300 305 310 315 

ggc aga aga tec tgc ccc gat ggt tec acg tgc aat aaa acc aca ttt 1011 
Gly Arg Arg Ser Cys Pro Asp Gly Ser Thr Cys Asn Lys Thr Thr Phe 
320 325 330 

aac cea gat tat aat tat aca aac ttt gac age ttt ggc tgg tct ttt 1059 
Asn Pro Asp Tyr Asn Tyr Thr Asn Phe Asp Ser Phe Gly Trp Ser Phe 
335 340 345 

etc gcc atg ttc egg gtt atg act caa gac tec tgg gag aag ctt tat 1107 
Leu Ala Met Phe Arg Val Met Thr Gin Asp Ser Trp Glu Lys Leu Tyr 
350 355 360 

cga cag ate ctt cgc acc tec ggg ate tac ttt gte ttc ttc ttc gtg 1155 
Arg Gin He Leu Arg Thr Ser Gly He Tyr Phe Val Phe Phe Phe Val 
365 370 375 

gte gte ate ttc ctg ggc tct ttc tac ctg ctt aac tta acc ctg get 1203 
Val Val He Phe Leu Gly Ser Phe Tyr Leu Leu Asn Leu Thr Leu Ala 
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380 385 390 395 

gtc gtc acc atg get tac gag gaa cag aac aga aat gtc get gcc gag 1251 
Val Val Thr Met Ala Tyr Glu Glu Gin Asn Arg Asn Val Ala Ala Glu 
400 405 410 

aca gag gcc aag gag aag atg ttt cag gaa gcc cag cag ctg ttg agg 1299 
Thr Glu Ala Lys Glu Lys Met Phe Gin Glu Ala Gin Gin Leu Leu Arg 
415 420 425 

gag gaa aag gag get ctg gtt gcc atg gga att gac aga act tec ctt 1347 
Glu Glu Lys Glu Ala Leu Val Ala Met Gly He Asp Arg Thr Ser Leu 
430 435 440 

aat tec etc caa get teg tee ttt tec cca aag aag agg aag ttt ttt 1395 
Asn Ser Leu Gin Ala Ser Ser Phe Ser Pro Lys Lys Arg Lys Phe Phe 
445 450 455 

ggc agt aag aca aga aag tec ttc ttt atg aga ggg tec aag aca gcc 1443 
Gly Ser Lys Thr Arg Lys Ser Phe Phe Met Arg Gly Ser Lys Thr Ala 
460 465 470 475 

cga gcc tea gcg tec gat tea gag gac gat gcc tct aaa aac cca caa 1491 
Arg Ala Ser Ala Ser Asp Ser Glu Asp Asp Ala Ser Lys Asn Pro Gin 
480 485 490 

etc ctt gag caa aca aaa cga eta tec cag aac ttg ecc gta gaa etc 1539 
Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin Asn Leu Pro Val Glu Leu 
495 500 505 

ttt gat gag cac gtg gac ecc etc cat agg cag aga gcg ctg agt gcc 1587 
Phe Asp Glu His Val Asp Pro Leu His Arg Gin Arg Ala Leu Ser Ala 
510 515 520 

gtc agt ate tta acc ate acc atg cag gaa caa gaa aaa tee cag gag 1635 
Val Ser He Leu Thr He Thr Met Gin Glu Gin Glu Lys Ser Gin Glu 
525 530 535 

cct tgt ttc ecg tgt ggg aaa aac ttg gca tec aag tac ctg gtg tgg 1683 
Pro Cys Phe Pro Cys Gly Lys Asn Leu Ala Ser Lys Tyr Leu Val Trp 
540 545 550 555 

gaa tgt age cct ecg tgg ctg tgc ata aag aag gtc ctg cag act ate 1731 
Glu Cys Ser Pro Pro Trp Leu Cys He Lys Lys Val Leu Gin Thr He 
560 565 570 

atg aca gac ecc ttc act gag ctg gcc ate acc ate tgc ate ate gtc 1779 
Met Thr Asp Pro Phe Thr Glu Leu Ala He Thr He Cys He He Val 
575 580 585 

aat act gtc ttc ttg gcc atg gaa cac cac aat atg gat aac tct tta 1827 
Asn Thr Val Phe Leu Ala Met Glu His His Asn Met Asp Asn Ser Leu 
590 595 600 

aaa gac ata ctg aaa ata gga aac tgg gtt ttc act gga att ttc ata 1875 
Lys Asp He Leu Lys He Gly Asn Trp Val Phe Thr Gly He Phe He 
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605 610 615 

gc9 gaa atg tgt etc aag ate att gcg eta gac ect tac cac tae ttc 1923 
Ala Glu Met Cys Leu Lys He He Ala Leu Asp Pro Tyr Mis Tyr Phe 
620 625 630 635 



egg cac ggc tgg aac ate ttt gac age 
Arg His Gly Trp Asn He Phe Asp Ser 
640 



att gtg gcc ctt gtg agt etc 1971 
He Val Ala Leu Val Ser Leu 
645 650 



get gac gtg etc ttc cac aaa ctg tct aaa aac etc tec ttc ttg get 2019 
Ala Asp Val Leu Phe His Lys Leu Ser Lys Asn Leu Ser Phe Leu Ala 
655 660 665 

tec etc aga gtg ctg agg gtc ttc aag tta gcc aaa tec tgg ccc aca 2067 
Ser Leu Arg Val Leu Arg Val Phe Lys I.eu Ala Lys Ser Trp Pro Thr 
670 675 680 

tta aac act etc att aag ate ate ggc cac tec gtg ggt gcg etc gga 2115 
Leu Asn Thr Leu He Lys He He Gly His Ser Val Gly Ala Leu Gly 
685 690 695 

aac ctg act gtg gtc eta acg ate gtg gtc ttc ate ttt tec gtg gtt 2163 
Asn Leu Thr Val Val Leu Thr He Val Val Phe He Phe Ser Val Val 
700 705 710 715 

ggc atg egg etc ttt ggt gcc aag ttt aac aag act tgc tec ace tct 2211 
Gly Met Arg Leu Phe Gly Ala Lys Phe Asn Lys Thr Cys Ser Thr Ser 
720 725 730 

ccg gag tec etc egg cgc tgg cac atg ggt gat ttc tae cat tec ttc 2259 
Pro Glu Ser Leu Arg Arg Trp His Met Gly Asp Phe Tyr His Ser Phe 
735 740 745 

ctg gtg gtg ttc cgc ate etc tgt ggg gag tgg ate gag aac atg tgg 2307 
Leu Val Val Phe Arg He Leu Cys Gly Glu Trp He Glu Asn Met Trp 
750 755 760 

gaa tgc atg cag gag atg gaa ggc tec ccg ctg tgt gtc ate gtc ttt 2355 
Glu Cys Met Gin Glu Met Glu Gly Ser Pro Leu Cys Val He Val Phe 
765 770 775 

gtg ctg ate atg gtg gtc ggg aag etc gtg gtg ctt aac etc ttc att 2403 
val Leu He Met Val Val Gly Lys Leu Val Val Leu Asn Leu Phe He 
780 785 790 795 

gcc ttg ctg etc aat tee ttc age aat gag gaa aag gat ggg aac cea 2451 
Ala Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu Lys Asp Gly Asn Pro 
800 805 810 

gaa gga gag ace agg aaa ace aaa gtg cag eta gee ctg gat egg ttc 2499 
Glu Gly Glu Thr Arg Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe 
815 820 825 

age ega gcg ttc tac ttc atg gcg cgc get ctt cag aat ttc tgt tgc 2547 
Ser Arg Ala Phe Tyr Phe Met Ala Arg Ala Leu Gin Asn Phe Cys Cys 
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830 835 840 

aag aga tgc agg agg caa aac teg cca aag cca aat gag gca aca gaa 2595 
Lys Arg Cys Arg Arg Gin Asn Ser Pro Lys Pro Asn Glu Ala Thr Glu 
845 850 855 

age ttt get ggt gag agt aga gac aca gcc acc ctg gat aca agg tec 2643 
Ser Phe Ala Gly Glu Ser Arg Asp Thr Ala Thr I*eu Asp Thr Arg Ser 
860 865 870 875 

tgg aag gag tat gat tea gaa atg aet etg tae act ggg cag gee ggg 2691 
Trp Lys Glu Tyr Asp Ser Glu Met Thr Leu Tyr Thr Gly Gin Ala Gly 
880 885 890 

get cca ctg gcc cca ctg gca aaa gaa gag gac gat atg gaa tgt tgt 2739 
Ala Pro Leu Ala Pro Leu Ala Lys Glu Glu Asp Asp Met Glu Cys Cys 
895 900 905 

ggt gaa tgt gat gcc tea cet acc tea eag ect agt gag gaa get cag 2787 
Gly Glu Cys Asp Ala Ser Pro Thr Ser Gin Pro Ser Glu Glu Ala Gin 
910 915 920 

gee tgt gae etc ect ctg aag ace aag egg etc eee age cca gat gac 2835 
Ala Cys Asp Leu Pro Leu Lys Thr Lys Arg Leu Pro Ser Pro Asp Asp 
925 930 935 

cac ggg gtt gaa atg gaa gtg ttt tec gaa gaa gat ccg aat tta acc 2883 
His Gly Val Glu Met Glu Val Phe Ser Glu Glu Asp Pro Asn Leu Thr 
940 945 950 955 

ata eag agt get cga aag aag tet gat gcg gca age atg etc tea gaa 2931 
lie Gin Ser Ala Arg Lys Lys Ser Asp Ala Ala Ser Met Leu Ser Glu 
960 965 970 

tgc age aca ata gae etg aat gat ate ttt aga aat tta cag aaa aca 2979 
Cys Ser Thr lie Asp Leu Asn Asp lie Phe Arg Asn Leu Gin Lys Thr 
975 980 985 

gtt tee eee caa aag caa cca gat cga tgc ttt ccc aag ggc etc agt 3027 
Val Ser Pro Gin Lys Gin Pro Asp Arg Cys Phe Pro Lys Gly Leu Ser 
990 995 1000 

tgt ate ttt eta tgt tgc aaa aca ate aaa aaa aag tec eee tgg gtc 3075 
Cys He Phe Leu Cys Cys Lys Thr He Lys Lys Lys Ser Pro Trp Val 
1005 1010 1015 

ctg tgg tgg aat ctt egg aaa ace tgc tae caa ate gtg aag eat age 3123 
Leu Trp Trp Asn Leu Arg Lys Thr Cys Tyr Gin He Val Lys Hi6 Ser 
1020 1025 1030 1035 

tgg ttt gag age ttc ata att ttt gtc ate etg ctg age age gga gca 3171 
Trp Phe Glu Ser Phe He He Phe val He Leu Leu Ser Ser Gly Ala 
1040 1045 1050 

ctg ata ttc gaa gat gtc aat ctt ccc age egg ccc caa gtt gaa aaa 3219 
Leu He Phe Glu Asp Val Asn Leu Pro Ser Arg Pro Gin Val Glu Lys 
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1055 1060 X065 

tta ctg aag tgt acc gat aat att ttc aca ttt att ttt etc ctg gaa 3267 
Leu Leu Lys Cys Thr Asp Asn lie Phe Thr Phe lie Phe Leu Leu Glu 
1070 1075 1080 

atg att ttg aag tgg gtg gcc ttt gga ttc egg aag tat ttc acc agt 3315 
Met He Leu Lys Trp Val Ala Phe Gly Phe Arg Lys Tyr Phe Thr Ser 
1085 1090 1095 

gcc tgg tgc tgg etc gat ttc etc att gtg gtg gtg tct gtg etc age 3363 
Ala Trp Cys Trp Leu Asp Phe Leu He Val Val Val Ser Val Leu Ser 
1100 1105 1110 1115 

etc acg aac tta cca aac ttg aag tec ttc egg aat ctg cga gcg ctg 3411 
Leu Thr Asn Leu Pro Asn Leu Lys Ser Phe Arg Asn Leu Arg Ala Leu 
1120 1125 1130 

aga cct ctg egg gea ctg tct eag ttt gaa gga atg aag gtt gtt gtc 3459 
Arg Pro Leu Arg Ala Leu Ser Gin Phe Glu Gly Met Lys Val Val Val 
' 1135 1140 1145 

aat gcc etc atg agt gcc ata cct gcc ate etc aat gtc ttg ctg gtc 3507 
Asn Ala Leu Met Ser Ala He Pro Ala He Leu Asn Val Leu Leu Val 
1150 1155 1160 

tgc etc att ttc tgg etc ata ttt tgt ate ctg gga gta aat ttt ttt 3555 
Cys Leu He Phe Trp Leu He Phe Cys He Leu Gly Val Asn Phe Phe 
1165 1170 1175 

tct ggg aag ttt gga aga tgc att aat gga aca gac ata aat aaa tat 3603 
Ser Gly Lys Phe Gly Arg Cys He Asn Gly Thr Asp He Asn Lys Tyr 
1180 1185 1190 1195 

ttc aac get tec aat gtt cca aac caa age caa tgt tta gtt agt aat 3651 
Phe Asn Ala Ser Asn Val Pro Asn Gin Ser Gin Cys Leu Val Ser Asn 
1200 1205 1210 

tac acg tgg aaa gtc ccg aat gtc aac ttt gac aac gtg ggg aat gcc 3699 
Tyr Thr Trp Lys Val Pro Asn Val Asn Phe Asp Asn Val Gly Asn Ala 
1215 1220 1225 

tac ctt gcc ctg ctg caa gtg gcg ace tat aag ggc tgg ctg gac att 3747 
Tyr Leu Ala Leu Leu Gin Val Ala Thr Tyr Lys Gly Trp Leu Asp He 
1230 1235 1240 

atg aat gea get gtt gat tec aga ggg aaa gat gag cag ccg gee ttt 3795 
Met Asn Ala Ala Val Asp Ser Arg Gly Lys Asp Glu Gin Pro Ala Phe 
1245 1250 1255 

gag gcg aat eta tac gea tac ctt tac ttc gtg gtt ttt ate ate ttc 3843 
Glu Ala Asn Leu Tyr Ala Tyr Leu Tyr Phe Val Val Phe He He Phe 
1260 1265 1270 1275 

ggc tea ttc ttt acc ctg aac etc ttt ate ggt gtt att att gac aac 3891 
Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp Asn 
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1280 1285 1290 

ttc aat cag cag cag aaa aag tta ggt ggc caa gac att ttt atg aca 3939 
Phe Asn Gin Qln Gin Lys Lys Leu Gly Gly Gin Asp lie Phe Met Thr 
1295 1300 1305 

gaa gaa cag aag aaa tat tac aat gca atg aaa aag tta gga acc aag 3987 
Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys 
1310 1315 1320 

aag cct caa aag ccc ate cca agg ccc ctg aac aaa tgt caa gcc ttc 4035 
Lys Pro Gin Lys Pro lie Pro Arg Pro Leu Asn Lys Cys Gin Ala Phe 
1325 1330 ,1335 

gtg ttc gat ttg gtc aca age eag gtc ttt gac gtc ate att ctg ggt 4083 
Val Phe Asp Leu Val Thr Ser Gin Val Phe Asp Val lie lie Leu Gly 
1340 1345 1350 1355 

ctt att gtc aca aac atg att' ate atg atg get gaa tct gaa ggc cag 4131 
Leu lie Val Thr Asn Met He He Met Met Ala Glu Ser Glu Gly Gin 
1360 1365 1370 

ccc aac gaa gtg aag aaa ate ttt gat att etc aac ata gtc ttc gtg 4179 
Pro Asn Glu Val Lys Lys He Phe Asp lie Leu Asn He Val Phe Val 
1375 1380 1385 

gtc ate ttt ace gta gag tgt etc ate aaa gtc ttt get ttg agg caa 4227 
Val He Phe Thr Val Glu Cys Leu He Lys Val Phe Ala Leu Arg Gin 
1390 1395 1400 

cac tac ttc acc aat ggc tgg aac tta ttt gat tgt gtg gtc gtg gtt 4275 
His Tyr Phe Thr Asn Gly Trp Asn Leu Phe Asp Cys Val Val Val Val 
1405 1410 1415 

ctt tec ate att agt acc ttg gtt tct ggc ttg gag aac age aac gtc 4323 
Leu Ser He He Ser Thr Leu Val Ser Gly Leu Glu Asn Ser Asn Val 
1420 1425 1430 1435 

ttc ecg ccc aca etc ttc agg att gtc cgc ttg get egg ate ggt ega 4371 
Phe Pro Pro Thr Leu Phe Arg He Val Arg Leu Ala Arg He Gly Arg 
1440 1445 1450 

ate etc aga ctg gtc egg geg get ega gga ate agg aca etc ctt ttc 4419 
He Leu Arg Leu Val Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe 
1455 1460 1465 

gcg ttg atg atg tct etc ccc tct etc ttc aac att ggt ctg ctt etc 4467 
Ala Leu Met Met Ser Leu Pro Ser Leu Phe Asn He Gly Leu Leu Leu 
1470 1475 1480 

ttt ctg gtg atg ttc att tat gee ate ttt ggg atg aac tgg ttt tec 4515 
Phe Leu Val Met Phe He Tyr Ala He Phe Gly Met Asn Trp Phe Ser 
1485 1490 1495 

aaa gtg aag aga ggc tct ggg att gat gac ate ttc aac ttt gac act 4563 
Lys Val Lys Arg Gly Ser Gly He Asp Asp He Phe Asn Phe Asp Thr 
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1500 X505 1510 1515 

ttc teg ggc age atg etc tgc etc ttc eag ata ace act tea gee ggc 461X 
Phe Ser Gly Ser Met Leu Cys Leu Phe Gin He Thr Thr Ser Ala Gly 
1520 1525 1530 

tgg gat get etc etc aac ecc atg ctg gaa tea aaa gee tet tgc aat 4659 
Trp Asp Ala Leu Leu Asn Pro Met Leu Glu Ser Lys Ala Ser Cys Asn 
1535 1540 1545 

tec tec tec caa gag age tgt eag eag ccg cag ata gee ata gtc tac 4707 
Ser Ser Ser Gin Glu Ser Cys Gin Gin Pro Gin He Ala He Val Tyr 
1550 1555 1560 

ttc gtc age tac ate ate ate tec ttt etc att gtg gtt aac atg tac 4755 
Phe Val Ser Tyr He He He Ser Phe Leu He Val Val Asn Met Tyr 
1565 1570 1575 

ata get gtg att eta gag aac ttc aac aca gee aca gag gag age gag 4803 
He Ala Val He Leu Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu 
1580 1585 1590 1595 

gac ccc ctg ggc gaa gac gac ttt gag ate ttc tat gag ate tgg gag 4851 
Asp Pro Leu Gly Glu Asp Asp Phe Glu He Phe Tyr Glu He Trp Glu 
1600 1605 1610 

aag ttt gac ccc gaa gea aca cag ttc ate cag tac tea tec etc tct 4899 
Lys Phe Asp Pro Glu Ala Thr Gin Phe He Gin Tyr Ser Ser Leu Ser. 
1615 1620 1625 

gac ttc gee gac gee ctg eee gag ccg ttg egt gtg gee aag ecc aac 4947 
Asp Phe Ala Asp Ala Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn 
1630 1635 1640 

agg ttt eag ttt etc atg atg gac ttg ccc atg gtg atg ggt gat cgc 4995 
Arg Phe Gin Phe Leu Met Met Asp Leu Pro Met Val Met Gly Asp Arg 
1645 1650 1655 

etc cat tgc atg gat gtt etc ttt get ttc ace acc agg gtc etc ggg 5043 
Leu His Cys Met Asp Val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly 
1660 1665 1670 1675 

aac tec age ggc ttg gat acc atg aaa gee atg atg gag gag aag ttc 5091 
Asn Ser Ser Gly Leu Asp Thr Met Lys Ala Met Met Glu Glu Lys Phe 
1680 1685 1690 

atg gag gee aat ect ttc aag aag ttg tac gag ccc att gtc ace acc 5139 
Met Glu Ala Asn Pro Phe Lys Lys Leu Tyr Glu Pro He Val Thr Thr 
1695 1700 1705 

aca aag agg aag gag gag gag gaa tgt gee get gtc ate cag agg gee 5187 
Thr Lys Arg Lys Glu Glu Glu Glu Cys Ala Ala Val He Gin Arg Ala 
1710 1715 1720 

tac egg aga cae atg gag aag atg ate aag ctg aag ctg aaa ggc agg 5235 
Tyr Arg Arg His Net Glu Lys Met He Lys Leu Lys Leu Lys Gly Arg 
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1725 1730 1735 

tea agt tea teg etc eag gtg ttt tgc aat gga gac ttg tct age ttg 5283 
Ser Ser Ser Ser Leu Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu 
1740 1745 1750 1755 

gat gtg ccc aag ate aag gtt cat tgt gac tgaaaceccc aectgcacgc 5333 
Asp Val Pro Lys lie Lys Val His Cys Asp 
1760 1765 

ctacctcaca gcetcaeagc tcagccccca gcctetggcg aacaageggc ggactcaccg 5393 

aaeaggccgt tcaacttgtt tttttgggtg aaagaggtga taggttggtg tecattttta 5453 

aatgattctt ggaaagattg aacgteggaa catgttagaa aggactgeca aggacatcca 5513 

cagtaacgga aggectgaag gaeagttcaa attatgtaaa gaaaegagaa ggaaaggtca 5573 

catgtetgtt cagttttaag tatgtgacet gceacatgta getcetttgc atgttaagtg 5633 

agaagtcaaa accctgecat aagtaaatag ctttgttgca ggtgttteta ceagtgctgc 5693 

ggatttgggt gtatggctca aacctgaaag catgactctg aettgtcagc accccaactt 5753 

tcagaagctc tgatctctgt cetaggtgtt tgaeaaataa atacataaaa naaaaaaaaa 5813 

aaaaaaaaa 5822 



<210> 5 

<211> 1765 

<212> PRT 

<213> Mus musculus 

<400> 5 

Met Glu Glu Arg Tyr Tyr Pro Val He Phe Pro Asp Glu Arg Asn Phe 
15 10 15 

Arg Pro Phe Thr Phe Asp Ser Leu Ala Ala He Glu Lys Arg He Thr 
20 25 30 

He Gin Lys Glu Lys Lys Lys Ser Lys Asp Lys Ala Ala Thr Glu Pro 
35 40 45 

Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 
50 55 60 

Leu Tyr Gly Asp Val Pro Pro Asp Leu He Ala Lys Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Phe Tyr Lys Asp His Lys Thr Phe Met Val Leu Asn Lys 
85 90 95 

Lys Arg Thr He Tyr Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu 
100 105 110 
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Qly Pro Phe Asn Pro He Arg Ser Phe Met He Arg He Ser Val His 
115 120 125 

Ser val Phe Ser Met Phe He He Cys Thr Val He He Asn Cys Met 
130 135 140 

Phe Met Ala Asn Asn Ser Ser Val Asp Ser Arg Pro Ser Ser Asn He 
145 150 155 160 

Pro Glu Tyr Val Phe He Gly He Tyr Val Leu Glu Ala Val He Lys 
165 170 175 

He Leu Ala Arg Gly Phe He Val Asp Glu Phe Ser Tyr Leu Arg Asp 
180 185 190 

Pro Trp Asn Trp Leu Asp Phe He Val He Gly Thr Ala He Ala Pro 
195 200 205 

Cys Phe Leu Gly Asn Lys Val Asn Asn Leu Ser Thr Leu Arg Thr Phe 
210 215 220 

Arg Val Leu Arg Ala Leu Lys Ala He Ser Val He Ser Gly Leu Lys 
225 230 235 240 

Val He Val Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asp Val 
245 250 255 

Met Val Leu Thr Leu Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin 
260 265 270 

Gin Leu Phe Met Gly He Leu Ser Gin Lys Cys He Lys Asp Asp Cys 
275 280 285 

Gly Pro Asn Ala Phe Ser Asn Lys Asp Cys Phe Val Lys Glu Asn Asp 
290 295 300 

Ser Glu Asp Phe He Met Cys Gly Asn Trp Leu Gly Arg Arg Ser Cys 

305 310 315 320 

Pro Asp Gly Ser Thr Cys Asn Lys Thr Thr Phe Asn Pro Asp Tyr Aan 
325 330 335 

Tyr Thr Asn Phe Asp Ser Phe Gly Trp Ser Phe Leu Ala Met Phe Arg 
340 345 350 

Val Met Thr Gin Asp Ser Trp Glu Lys Leu Tyr Arg Gin He Leu Arg 
355 360 365 

Thr Ser Gly He Tyr Phe Val Phe Phe Phe Val Val Val He Phe Leu 
370 375 380 

Gly Ser Phe Tyr Leu Leu Asn Leu Thr Leu Ala Val Val Thr Met Ala 
385 390 395 400 



Tyr Glu Glu Gin Asn Arg Asn Val Ala Ala Glu Thr Glu Ala Lys Glu 
405 410 415 
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Lys Met Phe Gin Glu Ala Gin Gin Leu Leu Arg Glu Glu Lys Glu Ala 
420 425 430 

Leu Val Ala Met Gly He Asp Arg Thr Ser Leu Asn Ser Leu Gin Ala 
435 440 445 

Ser Ser Phe Ser Pro Lys Lys Arg Lys Phe Phe Gly Ser Lys Thr Arg 
450 455 460 

Lys Ser Phe Phe Met Arg Gly Ser Lys Thr Ala Arg Ala Ser Ala Ser 
4^5 470 475 480 

Asp Ser Glu Asp Asp Ala Ser Lys Asn Pro Gin Leu Leu Glu Gin Thr 
485 490 495 

Lys Arg Leu Ser Gin Asn Leu Pro Val Glu Leu Phe Asp Glu His Val 
500 505 510 

Asp Pro Leu His Arg Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr 
515 520 525 

He Thr Met Gin Glu Gin Glu Lys Ser Gin Glu Pro Cys Phe Pro Cys 
530 535 540 

Gl^ Lys Asn Leu Ala Ser Lys Tyr Leu Val Trp Glu Cys Ser Pro Pro 
545 550 555 560 

Trp Leu Cys He Lys Lys Val Leu Gin Thr He Met Thr Asp Pro Phe 
565 570 575 

Thr Glu Leu Ala He Thr He Cys He He Val Asn Thr Val Phe Leu 
580 585 590 

Ala Met Glu His His Asn Met Asp Asn Ser Leu Lys Asp He Leu Lys 
595 600 605 

He Gly Asn Trp Val Phe Thr Gly He Phe He Ala Glu Met Cys Leu 
610 615 620 

Lys He He Ala Leu Asp Pro Tyr His Tyr Phe Arg His Gly Trp Asn 
625 630 635 640 

He Phe Asp Ser He Val Ala Leu Val Ser Leu Ala Asp Val Leu Phe 
645 650 655 

His Lys Leu Ser Lys Asn Leu Ser Phe Leu Ala Ser Leu Arg Val Leu 
660 665 670 

Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He 
675 680 685 

Lys He He Gly His Ser Val Gly Ala Leu Gly Asn Leu Thr Val Val 
690 695 700 



Leu Thr He Val Val Phe He Phe Ser Val Val Gly Met Arg Leu Phe 
705 710 715 720 
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Gly Ala Lys Phe Asn Lys Thr Cys Ser Thr Ser Pro Glu Ser Leu Arg 
725 730 735 

Arg Trp His Met Gly Asp Phe Tyr His Ser Phe Leu Val Val Phe Arg 
740 745 750 

lie Leu Cys Gly Glu Trp He Glu Asn Met Trp Glu Cys Met Gin Glu 
755 760 765 

Met Glu Gly Ser Pro Leu Cys Val lie Val Phe Val Leu He Met Val 
770 775 780 

val Gly Lys Leu Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn 
'^^^ 790 795 800 

Ser Phe Ser Asn Glu Glu Lys Asp Gly Asn Pro Glu Gly Glu Thr Arg 
805 810 815 

Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Ser Arg Ala Phe Tyr 
820 825 830 

Phe Met Ala Arg Ala Leu Gin Asn Phe Cys Cys Lys Arg Cys Arg Arg 
835 840 845 

Gin Asn Ser Pro Lys Pro Asn Glu Ala Thr Glu Ser Phe Ala Gly Glu 
850 855 860 

Ser Arg Asp Thr Ala Thr Leu Asp Thr Arg Ser Trp Lys Glu Tyr Asp 
865 870 875 880 

Ser Glu Met Thr Leu Tyr Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro 
885 890 895 

Leu Ala Lys Glu Glu Asp Asp Met Glu Cys Cys Gly Glu Cys Asp Ala 
900 905 910 

Ser Pro Thr Ser Gin Pro Ser Glu Glu Ala Gin Ala Cys Asp Leu Pro 
915 920 925 

Leu Lys Thr Lys Arg Leu Pro Ser Pro Asp Asp His Gly Val Glu Met 
530 935 940 

Glu Val Phe Ser Glu Glu Asp Pro Asn Leu Thr He Gin Ser Ala Arg 
545 950 955 960 

Lys Lys Ser Asp Ala Ala Ser Met Leu Ser Glu Cys Ser Thr He Asp 
965 970 975 

Leu Asn Asp He Phe Arg Asn Leu Gin Lys Thr Val Ser Pro Gin Lys 
980 985 990 

Gin Pro Asp Arg Cys Phe Pro Lys Gly Leu Ser Cys He Phe Leu Cys 
595 1000 1005 

Cys Lys Thr He Lys Lys Lys Ser Pro Trp Val Leu Trp Trp Asn Leu 
1010 1015 1020 
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Arg Lys Thr Cys Tyr Gin lie Val Lys His Ser Trp Phe Glu Ser Phe 
1025 1030 1035 1040 

lie He Phe val He Leu Leu Ser Ser Gly Ma Leu He Phe Glu Asp 
1045 1050 1055 

Val Asn Leu Pro Ser Arg Pro Gin Val Glu Lys Leu Leu Lys Cys Thr 
1060 1065 1070 

Asp Asn He Phe Thr Phe He Phe Leu Leu Glu Met He Leu Lys Trp 
1075 1080 1085 

Val Ala Phe Gly Phe Arg Lys Tyr Phe Thr Ser Ala Trp Cys Trp Leu 
1090 1095 1100 

Asp Phe Leu He Val Val Val Ser Val Leu Ser Leu Thr Asn Leu Pro 
1105 1110 1115 1120 

Asn Leu Lys Ser Phe Arg Asn Leu Arg Ala Leu Arg Pro Leu Arg Ala 
1125 1130 1135 

Leu Ser Gin Phe Glu Gly Met Lys Val Val Val Asn Ala Leu Met Ser 
1140 1145 1150 

Ala He Pro Ala He Leu Asn Val Leu Leu Val Cys Leu He Phe Trp 
1155 1160 1165 

Leu He Phe Cys He Leu Gly Val Asn Phe Phe Ser Gly Lys Phe Gly 
1170 * 1175 1180 

Arg Cys He Asn Gly Thr Asp He Asn Lys Tyr Phe Asn Ala Ser Asn 
1185 1190 1195 1200 

Val Pro Asn Gin Ser Gin Cys Leu Val Ser Asn Tyr Thr Trp Lys Val 
1205 1210 1215 

Pro Asn Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu 
1220 1225 1230 

Gin Val Ala Thr Tyr Lys Gly Trp Leu Asp He Met Asn Ala Ala Val 
1235 1240 1245 

Asp Ser Arg Gly Lys Asp Glu Gin Pro Ala Phe Glu Ala Asn Leu Tyr 
1250 1255 1260 

Ala Tyr Leu Tyr Phe Val Val Phe He He Phe Gly Ser Phe Phe Thr 
1265 1270 1275 1280 

Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn Gin Gin Gin 
1285 1290 1295 

Lys Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys 
1300 1305 1310 

Tyr Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro 
1315 1320 1325 
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He Pro Arg Pro Leu Asn Lys Cys Gin Ala Phe Val Phe Aap Leu Val 



"30 1335 ^3^^ 



Tbr Ser Gin Val Phe Asp Val He He Leu Gly Leu lie Val Thr Asn 



1350 3^355 



1360 



Met He He Met Met Ala Glu Ser Glu Gly Gin Pro Asn Glu Val Lys 
1365 1370 3^375 

Lys He Phe Asp He Leu Asn He Val Phe Val Val He Phe Thr Val 
"80 138S 1390 

Glu Cys^Leu He Lys Val Phe Ala Leu Arg Gin His Tyr Phe Thr 



1400 1405 



Asn 



''^^ ser He He Ser 

1420 



1410 1415 



^r^Leu val Ser Gly^^u Glu Asn Ser Asn Jal Phe Pro Pro Thr Leu 



Phe Arg He Val Arg Leu Ala Arg He Gly Arg He Leu Arg Leu 



1445 1450 



Val 



1455 



Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe Ala Leu Met Met Ser 



1460 1465 



1470 



Leu Pro ser Leu Phe Asn He Gly Leu Leu Leu Phe Leu Val Met Phe 
"75 1480 1485 

""'^ "^'^ '^'^ Arg Gly 

1495 1500 

ser Gly He Asp Asp He Phe Asn Phe Asp Thr Phe Ser Gly Ser Met 
1505 1510 ^5^5 



1520 



Leu cys Leu Phe Gin He Thr Thr Ser Ala Gly Trp Asp Ala Leu Leu 
"25 1530 ^535 

Asn Pro Met Leu Glu Ser Lys Ala Ser Cys Asn Ser Ser Ser Gin Glu 

1545 1550 

ser cys Gin Gin Pro Gin He Ala He Val Tyr Phe Val Ser Tyr He 
1555 1560 1565 

He He Ser Phe Leu He Val Val Asn Met Tyr He Ala Val He 
1570 1575 1580 



Leu 



Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu 

■ 1595 1600 

Asp Asp Phe Glu He Phe Tyr Glu He Trp Glu Lys Phe Asp Pro Glu 
1605 1610 1615 

Ala Thr Gin Phe He Gin Tyr Ser Ser Leu Ser Asp Phe Ala Asp Ala 

1625 1630 
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Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Arg Phe Gin Phe Leu 
1635 1640 1645 

Met Met Asp Leu Pro Met Val Met Gly Asp Arg Leu His Cys Met Asp 
1650 1655 1660 

Val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly Asn Ser Ser Gly Leu 
1665 1670 1675 1680 

Asp Thr Met Lys Ala Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro 
1685 1690 1695 

Phe Lys Lys Leu Tyr Glu Pro lie Val Thr Thr Thr Lys Arg Lys Glu 
1700 1705 1710 

Glu Glu Glu Cys Ala Ala Val He Gin Arg Ala Tyr Arg Arg His Met 
1715 1720 1725 

Glu Lys Met He Lys Leu Lys Leu Lys Gly Arg Ser Ser Ser Ser Leu 
1730 1735 1740 

Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu Asp Val Pro Lys He 
1745 1750 1755 1760 

Lys Val His Cys Asp 
1765 



<210> 6 
<211> 3701 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> <1) . . (3699) 

<223> partial human NaN cDNA sequence 
<220> 

<22l> unsure 
<222> (922) 
<223> y = c or t 

<400> 6 

tec att gtc att gga ata gcg att gtg tea tat att cca gga ate acc 48 
Ser He Val He Gly He Ala He Val Ser Tyr He Pro Gly He Thr 
15 10 15 

ate aaa eta ttg ecc ctg egt aec ttc cgt gtg ttc aga get ttg aaa 96 
He Lys Leu Leu Pro Leu Arg Thr Phe Arg Val Phe Arg Ala Leu Lys 
20 25 30 

gca att tea gta gtt tea cgt ctg aag gtc ate gtg ggg gee ttg eta 144 
Ala He Ser Val Val Ser Arg Leu Lys Val He Val Gly Ala Leu Leu 
35 40 45 
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cgc tct gtg aag aag ctg gtc aac gtg att ate etc ace ttc ttt tgc 192 
Arg Ser Val Lys Lys Leu Val Asn Val lie lie Leu Thr Phe Phe Cys 
50 55 €0 

etc age ate ttt gee etg gta ggt eag eag cte ttc atg gga agt ctg 240 
Leu Ser lie Phe Ala Leu Val 61y Gin Gin Leu Phe Met 6ly Ser Leu 
65 70 75 80 

aae ctg aaa tge ate teg agg gac tgt aaa aat ate agt aac ccg gaa 288 
Asn Leu Lys Cys lie Ser Arg Asp Cys Lys Asn He Ser Asn Pro Glu 
85 90 95 

get tat gac cat tge ttt gaa aag aaa gaa aat tea cct gaa tte aaa 336 
Ala Tyr Asp His Cys Phe Glu Lys Lys Glu Asn Ser Pro Glu Phe Lys 
100 105 110 

atg tgt gge ate tgg atg ggt aae agt gee tgt tee ata caa tat gaa 384 
Met Cys Gly He Trp Met Gly Asn Ser Ala Cys Ser He Gin Tyr Glu 
115 120 125 

tgt aag eae ace aaa att aat cct gac tat aat tat acg aat ttt gac 432 
Cys Lys His Thr Lys He Asn Pro Asp Tyr Asn Tyr Thr Asn Phe Asp 
130 135 140 

aac ttt ggc tgg tct ttt ctt gcc atg ttc egg ctg atg acc caa gat 480 
Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Leu Met Thr Gin Asp 
145 150 155 160 

tec tgg gag aag ctt tat eaa eag ace etg egt act act ggg cte tae 528 
Ser Trp Glu Lys Leu Tyr Gin Gin Thr Leu Arg Thr Thr Gly Leu Tyr 
165 170 175 

tea gtc tte tte tte att gtg gtc att ttc ctg ggc tec ttc tae ctg 576 
Ser Val Phe Phe Phe He Val Val He Phe Leu Gly Ser Phe Tyr Leu 
180 185 190 

att aac tta acc ctg get gtt gtt ace atg gca tat gag gag eag aac 624 
He Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu Gin Asn 
195 200 205 

aag aat gta get gea gag ata gag gee aag gaa aag atg ttt cag gaa 672 
Lys Asn Val Ala Ala Glu He Glu Ala Lys Glu Lys Met Phe Gin Glu 
210 215 220 

gcc cag cag ctg tta aag gag gaa aag gag get ctg gtt gee atg gga 720 
Ala Gin Gin Leu Leu Lys Glu Glu Lys Glu Ala Leu Val Ala Met Gly 
225 230 235 240 

att gac aga agt tea ctt act tec ctt gaa aca tea tat ttt acc cca 768 
He Asp Arg Ser Ser Leu Thr Ser Leu Glu Thr Ser Tyr Phe Thr Pro 
245 250 255 

aaa aag aga aag etc ttt ggt aat aag aaa agg aag tee tte ttt ttg 816 
Lys Lys Arg Lys Leu Phe Gly Asn Lys Lys Arg Lys Ser Phe Phe Leu 
260 265 270 
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aga gag tct ggg aaa gac cag cct cct ggg tea gat tct gat gaa gat 864 
Arg Glu Ser Gly Lys Asp Gin Pro Pro Gly Ser Asp Ser Asp Glu Asp 
275 280 285 

tgc caa aaa aag cca cag etc eta gag caa ace aaa cga ctg tec cag 912 
Cys Gin Lys Lys Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin 
290 295 300 

aat eta tea ytg gac cac ttt gat gag cat gga gat cct etc caa agg 960 
Asn Leu Ser Xaa Asp His Phe Asp Glu His Gly Asp Pro Leu Gin Arg 
305 310 3X5 320 

cag aga gea ctg agt get gte age ate etc ace ate ace atg aag gaa 1008 
Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met Lys Glu 
325 330 335 

caa gaa aaa tea caa gag cct tgt etc cct tgt gga gaa aac ctg gea 1056 
Gin Glu Lys Ser Gin Glu Pro Cys Leu Pro Cys Gly Glu Asn Leu Ala 
340 345 350 

tec aag tac etc gtg tgg aac tgt tgc ecc cag tgg ctg tgc gtt aag 1104 
Ser Lys Tyr Leu Val Trp Asn Cys Cys Pro Gin Trp Leu Cys Val Lys 
355 360 365 

aag gte ctg aga act gtg atg act gac ccg ttt act gag ctg gee ate 1152 
Lys Val Leu Arg Thr Val Met Thr Asp Pro Phe Thr Glu Leu Ala He 
370 375 380 

acc ate tgc ate ate ate aac act gte ttc ttg gcc atg gag cat cac 1200 
Thr He Cys He He He Asn Thr Val Phe Leu Ala Met Glu His His 
385 390 395 400 

aag atg gag gcc agt ttt gag aag atg ttg aat ata ggg aat ttg gtt 1248 
Lys Met Glu Ala Ser Phe Glu Lys Met Leu Asn He Gly Asn Leu Val 
405 410 415 

ttc act age att ttt ata gea gaa atg tgc eta aaa ate att geg etc 1296 
Phe Thr Ser He Phe He Ala Glu Met Cys Leu Lys He He Ala Leu 
420 425 430 

gat ecc tac cac tac ttt egc cga ggc tgg aac att ttt gac age att 1344 
Asp Pro Tyr His Tyr Phe Arg Arg Gly Trp Asn He Phe Asp Ser He 
435 440 445 

gtt get ett ctg agt ttt gea gat gta atg aac tgt gta ctt caa aag 1392 
Val Ala Leu Leu Ser Phe Ala Asp Val Met Asn Cys Val Leu Gin Lys 
450 455 460 

aga age tgg cca ttc ttg egt tec ttc aga gtg etc agg gte ttc aag 1440 
Arg Ser Trp Pro Phe Leu Arg Ser Phe Arg Val Leu Arg Val Phe Lys 
465 470 475 480 

tta gcc aaa tec tgg cca act ttg aac aca eta att aag ata ate ggc 1488 
Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He He Gly 
485 490 495 
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aac tct gtc gga gcc ctt gga age ctg act gtg gtc ctg gtc att gtg 1536 
Asn Ser Val Gly Ala Leu Gly Ser Leu Thr Val Val Leu Val lie Val 
500 505 510 

ate ttt att ttc tea gta gtt ggc atg cag ett ttt ggc cgt age ttc 1584 
He Phe He Phe Ser Val Val Gly Met Gin Leu Phe Gly Arg Ser Phe 
515 520 525 

aat tec caa aag agt cca aaa etc tgt aac ccg aca ggc ccg aca gtc 1632 
Asn Ser Gin Lys Ser Pro Lys Leu Cys Asn Pro Thr Gly Pro Thr Val 
530 535 540 

tea tgt tta egg cac tgg cac atg ggg gat ttc tgg cac tec ttc eta 1680 
Ser Cys Leu Arg His Trp His Met Gly Asp Phe Trp His Ser Phe Leu 
545 550 555 560 

gtg gta tte cge ate etc tgc ggg gaa tgg ate gaa aat atg tgg gaa 1728 
Val Val Phe Arg He Leu Cys Gly Glu Trp He Glu Asn Met Trp Glu 
565 570 575 

tgt atg eaa gaa geg aat gea tea tea tea ttg tgt gtt att gtc tte 1776 
Cys Met Gin Glu Ala Asn Ala Ser Ser Ser Leu Cys Val He Val Phe 
580 585 590 

ata ttg ate acg gtg ata gga aaa ctt gtg gtg etc aac etc tte att 1824 
He Leu He Thr Val He Gly Lys Leu Val Val Leu Asn Leu Phe He 
595 600 605 

gcc tta ctg etc aat tec ttt age aat gag gaa aga aat gga aac tta 1872 
Ala Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu Arg Asn Gly Asn Leu 
610 615 620 

gaa gga gag gee agg aaa act aaa gtc cag tta gea ctg gat cga ttc 1920 
Glu Gly Glu Ala Arg Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe 
625 630 635 640 

cgc egg get ttt tgt ttt gtg aga cac act ctt gag eat tte tgt cac 1968 
Arg Arg Ala Phe Cys Phe Val Arg His Thr Leu Glu His Phe Cys His 
645 650 655 

aag tgg tgc agg aag caa aac tta cca cag caa aaia gag gtg gea gga 2016 
Lys Trp Cys Arg Lys Gin Asn Leu Pro Gin Gin Lys Glu Val Ala Gly 
660 665 670 

ggc tgt get gea caa age aaa gac ate att cec ctg gtc atg gag atg 2064 
Gly Cys Ala Ala Gin Ser Lys Asp He He Pro Leu Val Met Glu Met 
675 680 685 

aaa agg ggc tea gag ace cag gag gag ctt ggt ata eta ace tct gta 2112 
Lys Arg Gly Ser Glu Thr Gin Glu Glu Leu Gly He Leu Thr Ser Val 
690 695 700 

cca aag ace ctg ggc gtc agg eat gat tgg act tgg ttg gea cca ett 2160 
Pro Lys Thr Leu Gly Val Arg His Asp Trp Thr Trp Leu Ala Pro Leu 
705 710 715 720 
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gcg gag gag gaa gat gac gtt gaa ttt tct ggt gaa gat aat gca cag 2208 
Ala Glu Glu Glu Asp Asp Val Glu Phe Ser Gly Glu Asp Asn Ala Gin 
725 730 735 

cgc ate aca caa cct gag cct gaa caa cag gcc tat gag etc cat cag 2256 
Arg lie Thr Gin Pro Glu Pro Glu Gin Gin Ala Tyr Glu Leu His Gin 
740 745 750 

gag aac aag aag ccc acg age cag aga gtt caa agt gtg gaa att gac 2304 
Glu Asn Lys Lys Pro Thr Ser Gin Arg Val Gin Ser Val Glu lie Asp 
755 . 760 765 

atg ttc tct gaa gat gag cct cat ctg ace ata cag gat ccc cga aag 2352 
Met Phe Ser Glu Asp Glu Pro His Leu Thr lie Gin Asp Pro Arg Lys 
770 775 780 

aag tct gat gtt acc agt ata eta tea gaa tgt age aec att gat ctt 2400 
Lys Ser Asp Val Thr Ser lie Leu Ser Glu Cys Ser Thr lie Asp Leu 
785 790 795 800 

cag gat ggc ttt gga tgg tta cct gag atg gtt ccc aaa aag caa cca 2448 
Gin Asp Gly Phe Gly Trp Leu Pro Glu Met Val Pro Lys Lys Gin Pro 
805 810 815 

gag aga tgt ttg ccc aaa ggc ttt ggt tgc tgc ttt cca tgc tgt age 2496 
Glu Arg Cys Leu Pro Lys Gly Phe Gly Cys Cys Phe Pro Cys Cys Ser 
820 825 830 

gtg gac aag aga aag cct ccc tgg gtc att tgg tgg aac ctg egg aaa 2544 
Val Asp Lys Arg Lys Pro Pro Trp Val lie Trp Trp Asn Leu Arg Lys 
835 840 845 

acc tgc tae caa ata gtg aaa cac age tgg ttt gag age ttt att ate 2592 
Thr Cys Tyr Gin lie Val Lys His Ser Trp Phe Glu Ser Phe lie lie 
850 855 860 

ttt gtg att ctg ctg age agt ggg gca ctg ata ttt gaa gat gtt cac 2640 
Phe Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val His 
865 870 875 880 

Ctt gag aac caa ccc aaa ate caa gaa tta eta aat tgt act gac att 2688 
Leu Glu Asn Gin Pro Lys He Gin Glu Leu Leu Asn Cys Thr Asp He 
885 890 895 

att ttt aca cat att ttt ate ctg gag atg gta eta aaa tgg gta gee 2736 
He Phe Thr His He Phe He Leu Glu Met Val Leu Lys Trp Val Ala 
900 905 910 

ttc gga ttt gga aag tat ttc acc agt gcc tgg tgc tgc ctt gat ttc 2784 
Phe Gly Phe Gly Lys Tyr Phe Thr Ser Ala Trp Cys Cys Leu Asp Phe 
915 920 925 

ate att gtg att gtc tct gtg acc ace etc att aac tta atg gaa ttg 2832 
He He Val He Val Ser Val Thr Thr Leu He Asn Leu Met Glu Leu 
930 935 940 
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aag tec ttc egg act eta cga gca ctg agg cct ctt cgt gcg ctg tec 2880 
Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser 
945 950 955 960 

cag ttt gaa gga atg aag gtg gtg gtc aat get etc ata ggt gcc ata 2928 
Gin Phe Glu Gly Met Lys Val Val Val Asn Ala Leu lie Gly Ala lie 
965 970 975 

cct gcc att ctg aat gtt ttg ctt gtc tgc etc att ttc tgg etc gta 2976 
Pro Ala lie Leu Asn Val Leu Leu Val cys Leu lie Phe Trp Leu Val 
980 985 990 

ttt tgt att ctg gga gta tac ttc ttt tct gga aaa ttt ggg aaa tgc 3024 
Phe Cys lie Leu Gly Val Tyr Phe Phe Ser Gly Lys Phe Gly Lys Cys 
995 1000 1005 

att aat gga aca gac tea gtt ata aat tat ace ate att aca aat aaa 3072 
lie Asn Gly Thr Asp Ser Val lie Asn Tyr Thr lie He Thr Asn Lys 
1010 1015 1020 

agt caa tgt gaa agt ggc aat ttc tct tgg ate aac cag aaa gtc aac 3120 
Ser Gin Cys Glu Ser Gly Asn Phe Ser Trp He Asn Gin Lys Val Asn 
1025 1030 1035 1040 

ttt gac aat gtg gga aat get tac etc get ctg ctg caa gtg gca aca 3168 
Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin Val Ala Thr 
1045 1050 1055 

ttt aag ggc tgg atg gat att ata tat gca get gtt gat tec aca gag 3216 
Phe Lys Gly Trp Met Aap lie lie Tyr Ala Ala Val Asp Ser Thr Glu 
1060 1065 1070 

aaa gaa caa cag cca gag ttt gag age aat tea etc ggt tac att tac 3264 
Lys Glu Gin Gin Pro Glu Phe Glu Ser Asn Ser Leu Gly Tyr lie Tyr 
1075 1080 1085 

ttc gta gtc ttt ate ate ttt ggc tea ttc ttc act ctg aat etc ttc 3312 
Phe Val Val Phe He He Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe 
1090 1095 1100 

att ggc gtt ate att gac aac ttc aac caa cag cag aaa aag tta ggt 3360 
He Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys Lys Leu Gly 

1105 1110 1115 1120 

ggc caa gac att ttt atg aca gaa gaa cag aag aaa tac tat aat gca 3408 
Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala 
1125 1130 1135 

atg aaa aaa tta gga tec aaa aaa cct caa aaa cec att cca egg cct 3456 
Met Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys Pro He Pro Arg Pro 
1140 1145 1150 

ctg aac aaa tgt caa ggt etc gtg ttc gac ata gtc aca age cag ate 3504 
Leu Asn Lys Cys Gin Gly Leu Val Phe Asp He Val Thr Ser Gin He 
1155 1160 1165 
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ttt gac ate ate ate ata agt etc att ate eta aac atg att age atg 3552 
Phe Asp lie Zle il He Ser Leu He He Leu Asn Met He Ser Met 
1170 1175 1180 

atg get gaa tea tae aae caa eec aaa gee atg aaa tee ate ett gac 3600 
Met Ala Glu Ser Tyr Asn Gin Pro Lys Ala Met Lys Ser He Leu Tisp 
1185 1190 1195 1200 

cat etc aac tgg gtc ttt gtg gte ate ttt aeg tta gaa tgt etc ate 3648 
His Leu Asn Trp Val Phe Val Val He Phe Thr Leu Glu Cys Leu He 
1205 1210 1215 

aaa ate ttt get ttg agg caa tfic tae tte aee aat gge tgg aat tta 3696 
Lys He Phe Ala Leu Arg Gin Tyr Tyr Phe Thr Asn Gly Trp Asn Leu 
1220 1225 1230 

ttt ga 3701 
Phe 



<210> 7 
<211> 1233 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Ser lie Val He Gly He Ala He Val Ser Tyr He Pro Gly He Thr 
15 10 15 

He Lys Leu Leu Pro Leu Arg Thr Phe Arg Val Phe Arg Ala Leu Lys 
20 25 30 

Ala He Ser Val Val Ser Arg Leu Lys Val He Val Gly Ala Leu Leu 
35 40 45 

Arg Ser Val Lys Lys Leu Val Asn Val He He Leu Thr Phe Phe Cys 
50 55 60 

Leu Ser He Phe Ala Leu Val Gly Gin Gin Leu Phe Met Gly Ser Leu 
65 70 75 80 

Asn Leu Lys Cys He Ser Arg Asp Cys Lys Asn He Ser Asn Pro Glu 
85 , 90 95 

Ala Tyr Asp His Cys Phe Glu Lys Lys Glu Asn Ser Pro Glu Phe Lys 
100 105 110 

Met Cys Gly He Trp Met Gly Asn Ser Ala Cys Ser He Gin Tyr Glu 
115 120 125 

Cys Lys His Thr Lys He Asn Pro Asp Tyr Asn Tyr Thr Asn Phe Asp 
130 135 140 

Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg lieu Met Thr Gin Asp 
145 150 155 160 
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Ser Trp Glu Lys Leu Tyr Gin Gin Thr Leu Arg Thr Thr Gly Leu Tyr 
165 170 175 

Ser Val Phe Phe Phe lie Val Val He Phe Leu Gly Ser Phe Tyr Leu 
180 185 190 

He Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu Gin Asn 
195 200 205 

Lys Asn Val Ala Ala Glu He Glu Ala Lys Glu Lys Met Phe Gin Glu 
210 215 220 

Ala Gin Gin Leu Leu Lys Glu Glu Lys Glu Ala Leu Val Ala Met Gly 
225 230 235 240 

He Asp Arg Ser Ser Leu Thr Ser Leu Glu Thr Ser Tyr Phe Thr Pro 
245 250 255 

Lys Lys Arg Lys Leu Phe Gly Asn Lys Lys Arg Lys Ser Phe Phe Leu 
260 265 270 

Arg Glu Ser Gly Lys Asp Gin Pro Pro Gly Ser Asp Ser Asp Glu Asp 
275 280 285 

Cys Gin Lys Lys Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin 
290 295 300 

Asn Leu Ser Xaa Asp His Phe Asp Glu His Gly Asp Pro Leu Gin Arg 
305 310 315 320 

Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met Lys Glu 
325 330 335 

Gin Glu Lys Ser Gin Glu Pro Cys Leu Pro Cys Gly Glu Asn Leu Ala 
340 345 350 

Ser Lys Tyr Leu Val Trp Asn Cys Cys Pro Gin Trp Leu Cys Val Lys 
355 360 365 

Lys Val Leu Arg Thr Val Met Thr Asp Pro Phe Thr Glii Leu Ala He 
370 375 380 

Thr He Cys He He He Asn Thr Val Phe Leu Ala Met Glu His His 
385 390 395 400 

Lys Met Glu Ala Ser Phe Glu Lys Met Leu Asn He Gly Asn Leu Val 
405 410 415 

Phe Thr Ser He Phe He Ala Glu Met Cys Leu Lys He He Ala Leu 
420 425 430 

Asp Pro Tyr His Tyr Phe Arg Arg Gly Trp Asn He Phe Asp Ser He 
435 440 445 



Val Ala Leu Leu Ser Phe Ala Asp Val Met Asn Cys Val Leu Gin Lys 
450 455 460 
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Arg Ser Trp Pro Phe Leu Arg Ser Phe Arg Val Leu Arg Val Phe Lys 
465 470 475 480 

Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He He Gly 
485 490 495 

Asn Ser Val Gly Ala Leu Gly Ser Leu Thr Val Val Leu Val He Val 
500 505 510 

He Phe He Phe Ser Val Val Gly Met Gin Leu Phe Gly Arg Ser Phe 
515 520 525 

Asn Ser Gin Lys Ser Pro Lys Leu Cys Asn Pro Thr Gly Pro Thr Val 
530 535 540 

Ser Cys Leu Arg His Trp His Met Gly Asp Phe Trp His Ser Phe Leu 
545 550 555 560 

Val Val Phe Arg He Leu Cys Gly Glu Trp He Glu Asn Met Trp Qlu 
565 570 575 

Cys Met Gin Glu Ala Asn Ala Ser Ser Ser Leu Cys Val He Val Phe 

580 585 590 

He Leu He Thr Val He Gly Lys Leu Val Val Leu Asn Leu Phe He 
595 600 605 

Ala Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu Arg Asn Gly Asn Leu 
610 615 620 

Glu Gly Glu Ala Arg Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe 
625 630 635 640 

Arg Arg Ala Phe Cys Phe Val Arg His Thr Leu Glu His Phe Cys His 
645 650 655 

Lys Trp Cys Arg Lys Gin Asn Leu Pro Gin Gin Lys Glu Val Ala Gly 
660 665 670 

Gly Cys Ala Ala Gin Ser Lys Asp He He Pro Leu Val Met Glu Met 
675 680 685 

Lys Arg Gly Ser Glu Thr Gin Glu Glu Leu Gly He Leu Thr Ser Val 
690 695 700 

Pro Lys Thr Leu Gly Val Arg His Asp Trp Thr Trp Leu Ala Pro Leu 
705 710 715 720 

Ala Glu Glu Glu Asp Asp Val Glu Phe Ser Gly Glu Asp Asn Ala Gin 

725 730 735 

Arg He Thr Gin Pro Glu Pro Glu Gin Gin Ala Tyr Glu Leu His Gin 
740 745 750 

Glu Asn Lys Lys Pro Thr Ser Gin Arg Val Gin Ser Val Glu He Asp 
755 760 765 
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Met Phe Ser Glu Asp Glu Pro His Leu Thr He Gin Asp Pro Arg Lys 
770 775 780 

Lys Ser Asp Val Thr Ser He Leu Ser Glu Cys Ser Thr He Asp Leu 
785 790 795 800 

Gin Asp Gly Phe Gly Trp Leu Pro Glu Met Val Pro Lys Lys Gin Pro 
805 8X0 815 

Glu Arg Cys Leu Pro Lys Gly Phe Gly Cys Cys Phe Pro Cys Cys Ser 
820 825 830 

Val Asp Lys Arg Lys Pro Pro Trp Val He Trp Trp Asn Leu Arg Lys 
835 840 845 

Thr Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He He 
850 855 860 

Phe Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val His 
865 870 875 880 

Leu Glu Asn Gin Pro Lys He Gin Glu Leu Leu Asn Cys Thr Asp He 
885 890 895 

He Phe Thr His He Phe He Leu Glu Met Val Leu Lys Trp Val Ala 
900 905 910 

Phe Gly Phe Gly Lys Tyr Phe Thr Ser Ala Trp Cys Cys Leu Asp Phe 
915 920 925 

He He Val He Val Ser Val Thr Thr Leu He Asn Leu Met Glu Leu 
930 935 940 

Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser 
945 950 955 960 

Gin Phe Glu Gly Met Lys Val Val Val Asn Ala Leu He Gly Ala He 
965 970 975 

Pro Ala He Leu Asn Val Leu Leu Val Cys Leu He Phe Trp Leu Val 

980 985 990 

Phe Cys He Leu Gly Val Tyr Phe Phe Ser Gly Lys Phe Gly Lys Cys 
995 1000 1005 

He Asn Gly Thr Asp Ser Val He Asn Tyr Thr He He Thr Asn Lys 
10X0 X015 1020 

Ser Gin Cys Glu Ser Gly Asn Phe Ser Trp He Asn Gin Lys Val Asn 
X025 1030 1035 1040 

Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin Val Ala Thr 
X045 1050 1055 

Phe Lys Gly Trp Met Asp He He Tyr Ala Ala Val Asp Ser Thr Glu 
1060 1065 1070 
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Lys Glu Gin Gin Pro Glu Phe Glu Ser Asn Ser Leu Gly Tyr lie Tyr 
1075 1080 1085 

Phe Val Val Phe lie He Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe 
1090 1095 1100 

He Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys Lys Leu Gly 
1105 1110 1115 1120 

Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala 
1125 1130 1135 

Met Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys Pro He Pro Arg Pro 
1140 1145 1150 

Leu Asn Lys Cys Gin Gly Leu Val Phe Asp He Val Thr Ser Gin He 
1155 1160 1165 

Phe Asp He He He He Ser Leu He He Leu Asn Met He Ser Met 
1170 1175 1180 

Met Ala Glu Ser Tyr Asn Gin Pro Lys Ala Met Lys Ser He Leu Asp 
1185 1190 1195 1200 

His Leu Asn Trp Val Phe Val Val He Phe Thr Leu Glu Cys Leu He 
1205 1210 1215 

Lys He Phe Ala Leu Arg Gin Tyr Tyr Phe Thr Asn Gly Trp Asn Leu 
1220 1225 1230 

Phe 



<210> 8 

<211> 1243 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> partial human NaN amino acid seq. 
<400> 8 

Ser He Val He Gly He Ala He Val Ser Tyr He Pro Gly He Thr 
15 10 15 

He Lys Leu Leu Pro Leu Arg Thr Phe Arg Val Phe Arg Ala Leu Lys 
20 25 30 

Ala He Ser Val Val Ser Arg Leu Lys Val lie Val Gly Ala Leu Leu 
35 40 45 

Arg Ser Val Lys Lys Leu Val Asn Val He He Leu Thr Phe Phe Cys 
50 55 60 



46 



wo 01/05831 PCT/USOO/19342 



Leu Ser lie Phe Ala Leu Val Gly Gin Gin Leu Phe Met Gly Ser Leu 
65 70 75 80 

Asn Leu Lys Cys He Ser Arg Asp Cys Lys Asn He Ser Asn Pro Glu 
85 90 95 

Ala Tyr Asp His Cys Phe Glu Lys Lys Glu Asn Ser Pro Glu Phe Ly? 
100 105 110 

Met Cys Gly He Trp Met Gly Asn Ser Ala Cys Ser He Gin Tyr Glu 
115 120 125 

Cys Lys His Thr Lys He Asn Pro Asp Tyr Asn Tyr Thr Asn Phe Asp 
X30 135 140 

Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Leu Met Thr Gin Asp 
145 150 155 160 

Ser Trp Glu Lys Leu Tyr Gin Gin Thr Leu Arg Thr Thr Gly Leu Tyr 
165 170 175 

Ser Val Phe Phe Phe He Val Val He Phe Leu Gly Ser Phe Tyr Leu 
180 185 190 

He Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu Gin Asn 
195 200 205 

Lys Asn Val Ala Ala Glu He Glu Ala Lys Glu Lys Met Phe Gin Glu 
210 215 220 

Ala Gin Gin Leu Leu Lys Glu Glu Lys Glu Ala Leu Val Ala Met Gly 
225 230 235 240 

He Asp Arg Ser Ser Leu Thr Ser Leu Glu Thr Ser Tyr Phe Thr Pro 
245 250 255 

Lys Lys Arg Lys Leu Phe Gly Asn Lys Lys Arg Lys Ser Phe Phe Leu 
260 265 270 

Arg Glu Ser Gly Lys Asp Gin Pro Pro Gly Ser Asp Ser Asp Glu Asp 
275 280 285 

Cys Gin Lys Lys Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin 
290 295 300 

Asn Leu Ser Leu Asp His Phe Asp Glu His Gly Asp Pro Leu Gin Arg 
305 310 315 320 

Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met Lys Glu 
325 330 335 

Gin Glu Lys Ser Gin Glu Pro Cys Leu Pro Cys Gly Glu Asn Leu Ala 
340 345 350 



Ser Lys Tyr Leu Val Trp Asn Cys Cys Pro Gin Trp Leu Cys Val Lys 
355 360 365 
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Lys Val Leu Arg Thr Val Met Thr Asp Pro Phe Thr Glu Leu Ala He 
370 375 380 

Thr He Cys He He He Asn Thr Val Phe Leu Ala Met Glu His His 
385 390 395 400 

Lys Met Glu Ala Ser Phe Glu Lys Met Leu Asn He Gly Asn Leu Val 
405 4X0 415 

Phe Thr Ser He Phe He Ala Glu Met Cys Leu Lys He He Ala Leu 
420 425 430 

Asp Pro Tyr His Tyr Phe Arg Arg Gly Trp Asn He Phe Asp Ser He 
435 440 445 

Val Ala Leu Leu Ser Phe Ala Asp Val Met Asn Cys Val Leu Gin Lys 
450 455 460 

Arg Ser Trp Pro Phe Leu Arg Ser Phe Arg Val Leu Arg Val Phe Lys 
465 470 475 480 

Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He He Gly 
485 490 495 

Asn Ser Val Gly Ala Leu Gly Ser Leu Thr Val Val Leu Val He Val 
500 505 510 

He Phe He Phe Ser Val Val Gly Met Gin Leu Phe Gly Arg Ser Phe 
515 520 525 

Asn Ser Gin Lys Ser Pro Lys Leu Cys Asn Pro Thr Gly Pro Thr Val 
530 535 540 

Ser Cys Leu Arg His Trp His Met Gly Asp Phe Trp His Ser Phe Leu 
545 550 555 560 

Val Val Phe Arg He Leu Cys Gly Glu Trp He Glu Asn Met Trp Glu 
565 570 575 

Cys Met Gin Glu Ala Asn Ala Ser Ser Ser Leu Cys Val He Val Phe 
580 585 590 

He Leu He Thr Val He Gly Lys Leu Val Val Leu Asn Leu Phe He 
595 600 605 

Ala Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu Arg Asn Gly Asn Leu 
610 615 620 

Glu Gly Glu Ala Arg Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe 
625 630 635 640 

Arg Arg Ala Phe Cys Phe Val Arg His Thr Leu Glu His Phe Cys His 
645 650 655 



Lys Trp Cys Arg Lys Gin Asn Leu Pro Gin Gin Lys Glu Val Ala Gly 
660 665 670 
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Gly Cys Ala Ala Gin Ser Lys Asp He He Pro Leu Val Met Glu Met 
675 680 685 

Lys Arg Gly Ser Glu Thr Gin Glu. Glu Leu Gly He Leu Thr Ser Val 
690 695 700 

Pro Lys Thr Leu Gly Val Arg His Asp Trp Thr Trp Leu Ala Pro Leu 
705 710 715 720 

Ala Glu Glu Glu Asp Asp Val Glu Phe Ser Gly Glu Asp Asn Ala Gin 
725 730 735 

Arg He Thr Gin Pro Glu Pro Glu Gin Gin Ala Tyr Glu Leu His Gin 
740 745 750 

Glu Asn Lys Lys Pro Thr Ser Gin Arg Val Gin Ser Val Glu He Asp 
755 760 765 

Met Phe Ser Glu Asp Glu Pro His Leu Thr He Gin Asp Pro Arg Lys 
770 775 780 

Lys Ser Asp Val Thr Ser He Leu Ser Glu Cys Ser Thr He Asp Leu 
785 790 795 800 

Gin Asp Gly Phe Gly Trp Leu Pro Glu Met Val Pro Lys Lys Gin Pro 
805 810 815 

Glu Arg Cys Leu Pro Lys Gly Phe Gly Cys Cys Phe Pro Cys Cys Ser 
820 825 830 

Val Asp Lys Arg Lys Pro Pro Trp Val He Trp Trp Asn Leu Arg Lys 
835 840 845 

Thr Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He He 
850 855 860 

Phe Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val His 
865 870 875 880 

Leu Glu Asn Gin Pro Lys He Gin Glu Leu Leu Asn Cys Thr Asp He 
885 890 895 

He Phe Thr His He Phe He Leu Glu Met Val Leu Lys Trp Val Ala 
900 905 910 

Phe Gly Phe Gly Lys Tyr Phe Thr Ser Ala Trp Cys Cys Leu Asp Phe 
915 920 925 

He He Val He Val Ser Val Thr Thr Leu He Asn Leu Met Glu Leu 
930 935 940 

Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser 
945 950 955 960 



Gin Phe Glu Gly Met Lys Val Val Val Asn Ala Leu He Gly Ala He 
965 970 975 
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Pro Ala lie Leu Asn Val Leu Leu Val Cys Leu lie Phe Trp Leu Val 
980 985 990 

Phe Cys He Leu Gly Val Tyr Phe Phe Ser Gly Lys Phe Gly Lys Cys 
995 1000 1005 

He Asn Gly Thr Asp Ser Val He Asn Tyr Thr He He Thr Asn Lys 
1010 1015 1020 

Ser Gin Cys Glu Ser Gly Asn Phe Ser Trp He Asn Gin Lys Val Asn 
1025 1030 1035 1040 

Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin Val Ala Thr 
1045 1050 1055 

Phe Lys Gly Trp Met Asp He He Tyr Ala Ala Val Asp Ser Thr Glu 
1060 1065 1070 

Lys Glu Gin Gin Pro Glu Phe Glu Ser Asn Ser Leu Gly Tyr He Tyr 
1075 1080 1085 

Phe Val Val Phe He He Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe 
1090 1095 1100 

He Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys Lys Leu Gly 
1105 1110 1115 1120 

Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala 
1125 1130 1135 

Met Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys Pro He Pro Arg Pro 
1140 1145 1150 

Leu Asn Lys Cys Gin Gly Leu Val Phe Asp He Val Thr Ser Gin He 
1155 1160 1165 

Phe Asp He He He He Ser Leu He He Leu Asn Met He Ser Met 
1170 1175 1180 

Met Ala Glu Ser Tyr Asn Gin Pro Lys Ala Met Lys Ser He Leu Asp 
1185 1190 1195 1200 

His Leu Asn Trp Val Phe Val Val He Phe Thr Leu Glu Cys Leu He 
1205 1210 1215 

Lys He Phe Ala Leu Arg Gin Tyr Tyr Phe Thr Asn Gly Trp Asn Leu 
1220 1225 1230 

Phe Asp Cys Val Val Val Leu Leu Ser He Val 
1235 1240 



<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<22l> variation 

<222> (6) 

<223> r a a or g 

<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer no. 1 

<400> 9 

gacccrtgga attggttgga 



<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer no. 2 

<400> 10 

aatccctgga attggttgga 



<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer no. 3 

<400> 11 

gacccgtgga actggttaga 



<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer no. 4 

<400> 12 

gatctttgga actggcttga 



<210> 13 
<2ll> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: rat Nan 
forward primer no. 5 

<400> 13 

aacatagtgc tggagttcag g 



<210> 14 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer no. 6 

<400> 14 

gtggcctttg gattccggag g 



<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
reverse primer no. 1 

<400> 15 

caagaiaggcc cagctgaagg tgtc 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
reverse primer no. 2 

<400> 16 

gaggaatgcc. cacgcaaagg aatc 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
reverse primer no. 3 
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<400> 17 

aagaagggac cagccaaagt tgtc 



24 



<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 

reverse primer no. 4, y = c or t, r = a or g, n « 
a or c or g or t, w = a or t 

<400> 18 

acytccatrc ouiwcccacat 20 



<210> 19 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
reverse primer no. 5, r = a or g 



<210> 20 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NcLN 
reverse primer no. 6 

<400> 20 

tctgctgccg agccaggta 19 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
reverse primer no. 7 



<400> 19 

agraartcna gccarcacca 



20 



<400> 21 

ctgagataac tgaaatcgcc 



20 
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<210> 22 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer, 
Marathon AP-l 

<400> 22 

ccatcctaat acgactcact atagggc 27 



<210> 23 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer, 
Marathon AP-2 



<210> 24 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mouse NaN 
forward primer 

<400> 24 

ccctgctgcg ctcggtgaag aag 23 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mouse NaN 
reverse primer 



<400> 23 

actcactata gggctcgagc ggc 



23 



<400> 25 

gacaaagtag atcccagagg 



20 



<210> 26 
<211> 17 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: human NaN 
forward primer 

<400> 26 

ctcagtagtt ggcatgc 



<210> 27 
<211> 24 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: humeui NaN 
reverse primer 

<400> 27 

ggaaagaagc acgaccacac agtc 



<210> 28 
<211> 94 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<223> C-terminal tnmcated rat NaN 
<400> 28 

Ala Ala Gly Gin Ala Met Arg Lys Gin Gly Asp He Leu Qly Pro Asn 
15 10 15 

He His Gin Phe Ser Gin Ser Ser Glu Thr Pro Phe Leu Gly Cys Pro 
20 25 30 

Gin Gin Arg Thr Cys Val Ser Phe Val Arg Pro Gin Arg Val Leu Arg 
35 40 45 

Val Pro Trp Phe Pro Ala Trp Arg Thr Val Thr Phe Leu Ser Arg Pro 
50 55 60 

Arg Ser Ser Glu Ser Ser Ala Trp Leu Gly Leu Val Glu Ser Ser Gly 
65 70 75 80 

Trp Ser Gly Leu Pro Gly Glu Ser Gly Pro Ser Ser Leu Leu 
85 90 



<210> 29 

<211> 211 

<212> DNA 

<213> Mus musculus 

<400> 29 

agtttaatgt tgagtgaatt gtggtggtga tttcccactt gaggcctttg tgttaaagcc 60 
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caatgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtggtt 120 
ggggggtggt ggcagagtct ggtattggta aggtgagagc aatcccagaa cgtccacctg 180 
ctcttccatt ttattaatca ggcaggcctc t 211 



<210> 30 

<211> 242 

<212> DNA 

<213> Mus mus cuius 



<400> 30 

gtaagccact ggctcttaac taaaatgctc 
ggtggtggtg gtggtggtgg tggtggtggt 
ggtggaggtg gnggtggagg tggtggctgt 
ggangtggtg gcgtggtggt ggnggtggtg 

gc 



gttggcatta gaacatttct gagctggggt 60 
ggtggtggtg gtggtggtgg tgatggtggt 120 
ggtggtggng gtggtggtgg tggtggangt 180 
gtggaggtgg tggctgtggt ggtngtggtg 240 

242 



<210> 31 

<211> 200 

<212> DNA 

<213> Mus musculus 



<400> 31 

tgtgcatgct tgattcccag ctcctatggt 
gactgtccac cctgacacac acacacacac 
acagtgtaga gaattacctc attcttggag 
agttcacaga gcaacagctg 



<210> 32 

<211> 181 

<212> DNA 

<213> Mus musculus 



ctgattactc ggtccttagg agcaaggcca 60 
acacacacac acacacacac acacacacac 120 
tttctctgga aaaggaatgt ctcaaagcca 180 

200 



<400> 32 

tgttagaaac tctaagacaa tgaagcacca 
catgcattac ccactgcttg tgctttcacc 
ctctctctct ctctctctct ctctctctct 
t 



tgctggaaat aagagcacaa actctttctt 60 
ttagtgctcg tgctctctct ttctctctct 12 0 

ctctctctct CtCtCtcttt tttttttttt 180 

181 



<210> 33 

<211> 128 

<212> DNA 

<213> Mus musculus 



<400> 33 

cacacacaca cacacacaca cacacacaca cacacacaca gagaaacact gtcgcagtca 60 
tacatataaa gataaataca tcttaaaaaa agaaccatgt gattgagtta taaaatattc 120 
caacttat ^28 



<210> 34 
<211> 200 
<212> DNA 
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<213> Mus musculus 



<400> 34 

aggtcatttc ctctgcagtg tgcttggcag 
cctgcttgat catccatgta tcacacacac 
tggggaagaa ggaagaggtt aagcacatag 
acacctgatc attcttccat 



gaaaaacttc ctggctattc aagtcagtgc 60 
acaaaacaaa caaacaaaca aacaaaaccc 120 
gcagagagca gccaggctga ctcagagcaa 180 

200 



<210> 35 

<211> 158 

<212> DNA 

<213> Mus musculus 

<400> 35 

gtgctgggat caaaggcgtg cgccgccacc acgcccggcc cctttttatg tttcaaattt 60 
acttttatca tgtgcacgtg tgtgggtgcg tgcatgtgtg tgcgtgcgtg tgcgtgtgng 120 
tgtgngtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtg 158 



<210> 36 

<211> 113 

<212> DNA 

<213> Mus musculus 

<400> 36 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacttg 60 
catctttgag ttaattggat aggctgagtc ttacaccgga atcatactgt tgc 113 



<210> 37 

<211> 200 

<212> DNA 

<213> Mus musculus 

<400> 37 

ccaatgagag actcttgtct caaaaaagcc 
aagaatgtgc tctggcctga aaacacacac 
agttttattt atttatttaa aaaaatatgt 
gattaagtca accagagcca 



atggtgtcca gatcctgagg aataacacct 60 
acacacacac acacacacac acacacacac 120 
ctctaggcat tgctgaaatg tctcctacag 180 

200 



<210> 38 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: protein seq. 
basis for rat NaN reverse primers 

<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa = Val or Asp 
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FIG. I A 

Nucleolide sequence of rat NaN. Translation initiation begins 

at position 41 (ATG). Reading frame ends at position 5336 (TGA). 

1 ACGGTGCCCT GATCCTCTGT ACCAGGAAGA CAGGGTGAAG ATGGAGGAGA 

51 GGTACTACCC GGTGATCn^C CCGGACGAGC GGAATTTCCG CCCCTTCACT 

101 TCCGACTCTC TGGCTGCCAT AGAGAAGCGG ATTGCTATCC AAAAGGAGAG 

151 GAAGAAGTCC AAAGACAAGG CGGCAGCTGA GCCCCAGCCT CGGCCTCAGC 

201 TTGACCTAAA GGCCTCCAGG AAGTTACCTA AGCTTTATGG TGACATTCCC 

251 CCTGAGCTTG TAGCGAAGCC TCTGGAAGAC CTGGACCCAT TCTACAAAGA 

301 CCATAAGACA TTCATGGTGT TGAACAAGAA GAGAACAATT TATCGCTTCA 

351 GCGCCAAGCG GGCCTTGTTC ATTCTGGGGC CTTTTAATCC CCTCAGAAGC 

401 TTAATGATTC GTATCTCTGT CCATTCAGTC TTTAGCATGT TCATCATCTG 

451 CACGGTGATC ATCAACTGTA TGTTCATGGC GAATTCTATG GAGAGAAGTT 

501 TCGACAACGA CATTCCCGAA TACGTCTTCA TTGGGATTTA TATTTTAGAA 

551 GCTGTGATTA AAATATTGGC AAGAGGCTTC ATTGTGGATG AGTTTTCCTT 

601 CCTCCGAGAT CCGTGGAACT GGCTGGACTT CATTGTCATT GGAACAGCGA 

651 TCGCAACTTG TTTTCCGGGC AGCCAAGTCA ATCTTTCAGC TCTTCGTACC 

701 TTCCGAGTGT TCAGAGCTCT GAAGGCGATT TCAGTTATCT * CAGGTCTGAA 

751 GGTCATCGTA GGTGCCCTGC TGCGCTCGGT GAAGAAGCTG GTAGACGTGA 

801 TGGTCCTCAC TCTCTTCTGC CTCAGCATCT TTGCCCTGGT CGGTCAGCAG 

851 CTGTTCATGG GAATTCTGAA CCAGAAGTGT ATTAAGCACA ACTGTGGCCC 

901 CAACCCTGCA TCCAACAAGG ATTGTTTTGA AAAGGAAAAA GATAGCGAAG 

951 ACTTCATAAT GTGTGGTACC TGGCTCGGCA GCAGACCCTG TCCCAATGGT 

1001 TCTACGTGCG ATAAAACCAC ATTGAACCCA GACAATAATT ATACAAAGTT 

1051 TGACAACTTT GGCTGGTCCT TTCTCGCCAT GTTCCGGGTT ATGACTCAAG 

1101 ACTCCTGGGA GAGGCTTTAC CGACAGATCC TGCGGACCTC TGGGATCTAC 

1151 TTTGTCTTCT TCTTCGTGGT GGTCATCTTC CTGGGCTCCT TCTACCTGCT 

1201 TAACCTAACC CTGGCTGTTG TCACCATGGC TTATGAAGAA CAGAACAGAA 

1251 ATGTAGCTGC TGAGACAGAG GCCAAGGAGA AAATGTTTCA GGAAGCCCAG 

1301 CAGCTGTTAA GGGAGGAGAA GGAGGCTCTG GTTGCCATGG GAATTGACAG 

1351 AAGTTCCCTT AATTCCCTTC AAGCTTCATC CTTTTCCCCG AAGAAGAGGA 

1401 AGTTTTTCGG TAGTAAGACA AGAAAGTCCT TCTTTATGAG AGGGTCCAAG 

1451 ACGGCCCAAG CCTCAGCGTC TGATTCAGAG GACGATGCCT CTAAAAATCC 

1501 ACAGCTCCTT GAGCAGACCA AACGACTGTC CCAGAACTTG CCAGTGGATC 

1551 TCTTTGATGA GCACGTGGAC CCCCTCCACA GGCAGAGAGC GCTGAGCGCT 

1601 GTCAGTATCT TAACCATCAC CATGCAGGAA CAAGAAAAAT TCCAGGAGCC 

1651 TTGTTTCCCA TGTGGGAAAA ATTTGGCCTC TAAGTACCTG GTGTGGGACT 
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1701 


GTAGCCCTCA 


ATGGCTGTGC 


ATAAAGAAGG 


TCCTGCGGAC 


CATCATGACG 


1751 


GATCCCTTTA 


CTGAGCTGGC 


CATCACCATC 


TGCATCATCA 


TCAATACCGT 


1801 


TTTCTTAGCC 


GTGGAGCACC 


ACAACATGGA 


TGACAACTTA 


AAGACCATAC 


1851 


TGAAAATAGG 


AAACTGGGTT 


TTCACGGGAA 


TTTTCATAGC 


GGAAATGTGT 


1901 


CTCAAGATCA 


TCGCGCTCGA 


CCCTTACCAC 


TACTTCCGGC 


ACGGCTGGAA 


1951 


TGTTTTTGAC 


AGCATCGTGG 


CCCTCCTGAG 


TCTCGCTGAT 


GTGCTNTACA 


2001 


ACACACTGTC 


TGATAACAAT 


AGGTCTTTCT 


TGGCTTCCCT 


CAGAGTGCTG 


2051 


AGGGTCTTCA 


AGTTAGCCAA 


ATCCTGGCCC 


ACGTTAAACA 


CTCTCATTAA 


2101 


GATCATCGGC 


CACTCCGTGG 


GCGCGCTTGG 


AAACCTGACT 


GTGGTCCTGA 


2151 


CTATCGTGGT 


CTTCATCTTT 


TCTGTGGTGG 


GCATGCGGCT 


CTTCGGCACC 


2201 


AAGTTTAACA 


AGACCGCCTA 


CGCCACCCAG 


GAGCGGCCCA 


GGCGGCGCTG 


2251 


GCACATGGAT 


AATTTCTACC 


ACTCCTTCCT 


GGTGGTGTTC 


CGCATCCTCT 


2301 


GTGGGGAATG 


GATCGAGAAC 


ATGTGGGGCT 


GCATGCAGGA 


TATGGACGGC 


2351 


TCCCCGTTGT 


GCATCATTGT 


CTTTGTCCTG 


ATAATGGTGA 


TCGGGAAGCT 


2401 


TGTGGTGCTT 


AACCTCTTCA 


TTGCCTTGCT 


GCTCAATTCC 


TTCAGCAATG 


2451 


AGGAGAAGGA 


TGGGAGCCTG 


GAAGGAGAGA 


CCAGGAAAAC 


CAAAGTGCAG 


2501 


CTAGCCCTGG 


ATCGGTTCCG 


CCGGGCCTTC 


TCCTTCATGC 


TGCACGCTCT 


2551 


TCAGAGTTTT 


TGTTGCAAGA 


AATGCAGGAG 


GAAAAACTCG 


CCAAAGCCAA 


2601 


AAGAGACAAC 


AGAAAGCTTT 


GCTGGTGAGA 


ATAAAGACTC 


AATCCTCCCG 


2651 


GATGCGAGGC 


CCTGGAAGGA 


GTATGATACA 


GACATGGCTT 


TGTACACTGG 


2701 


ACAGGCCGGG 


GCTCCGCTGG 


CCCCACTCGC 


AGAGGTAGAG 


GACGATGTGG 


2751 


AATATTGTGG 


TGAAGGCGGT 


GCCCTACCCA 


CCTCACAACA 


TAGTGCTGGA 


2801 


GTTCAGGCCG 


GTGACCTCCC 


TCCAGAGACC 


AAGCAGCTCA 


CTAGCCCGGA 


2851 


TGACCAAGGG 


GTTGAAATGG 


AAGTATTTTC 


TGAAGAAGAT 


CTGCATTTAA 


2901 


GCATACAGAG 


TCCTCGAAAG 


AAGTCTGACG 


CAGTGAGCAT 


GCTCTCGGAA 


2951 


TCCAGCACAA 


TTGACCTGAA 


TGATATCTTT 


AGAAATTTAC 


AGAAAACAGT 


3001 


TTCCCCCAAA 


AAGCAGCCAG 


ATAGATGCTT 


TCCCAAGGGC 


CTTAGTTGTC 


3051 


ACTTTCTATG 


CCACAAAACA 


GACAAGAGAA 


AGTCCCCCTG 


GGTCCTGTGG 


3101 


TGGAACATTC 


GGAAAACCTG 


CTACCAAATC 


GTGAAGCACA 


GCTGGTTTGA 


3151 


GAGTTTCATA 


ATCTTTGTTA 


TTCTGCTGAG 


CAGTGGAGCG 


CTGATATTTG 


3201 


AAGATGTCAA 


TCTCCCCAGC 


CGGCCCCAAG 


TTGAGAAATT 


ACTAAGGTGT 


3251 


ACCGATAATA 


TTTTCACATT 


TATTTTCCTC 


CTGGAAATGA 


TCCTGAAGTG 


3301 


GGTGGCCTTT 


GGATTCCGGA 


GGTATTTCAC 


CAGTGCCTGG 


TGCTGGCTTG 


3351 


ATTTCCTCAT 


TGTGGTGGTG 


TCTGTGCTCA 


GTCTCATGAA 


TCTACCAAGC 


3401 


TTGAAGTCCT 


TCCGGACTCT 


GCGGGCCCTG 


AGACCTCTGC 


GGGCGCTGTC 


3451 


CCAGTTTGAA 


GGAATGAAGG 


TTGTCGTCTA 


CGCCCTGATC 


AGCGCCATAC 


3501 


CTGCCATTCT 


CAATGTCTTG 


CTGGTCTGCC 


TCATTTTCTG 


GCTCGTATTT 


3551 


TGTATCTTGG 


GAGTAAATTT 


ATTTTCTGGG 


AAGTTTGGAA 


GGTGCATTAA 


3601 


CGGGACAGAC 


ATAAATATGT 


ATTTGGATTT 


TACCGAAGTT 


CCGAACCGAA 
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3651 


GCCAATGTAA 


CATTAGTAAT 


TACTCGTGGA 


AGGTCCCGCA 


GGTCAACTTT 


3701 


GACAACGTGG 


GGAATGCCTA 


TCTCGCCCTG 


CTGCAAGTGG 


CAACCTATAA 


3751 


GGGCTGGCTG 


GAAATCATGA 


ATGCTGCTGT 


CGATTCCAGA 


GAGAAAGACG 


3801 


AGCAGCCGGA 


CTTTGAGGCG 


AACCTCTACG 


CGTATCTCTA 


CTTTGTGGTT 


3851 


TTTATCATCT 


TCGGCTCCTT 


CrrTACCCTG 


AACCTCTTTA 


TCGGTGTTAT 


3901 


TATTGACAAC 


TTCAATCAGC 


AGCAGAAAAA 


GTTAGGTGGC 


CAAGACATTT 


3951 


TTATGACAGA 


AGAACAGAAG 


AAATATTACA 


ATGCAATGAA 


AAAGTTAGGA 


4001 


ACCAAGAAAC 


CTCAAAAGCC 


CATCCCAAGG 


CCCCTGAACA 


ANTGTCAAGC 


4051 


CTTTGTGTTC 


GACCTGGTCA 


CAAGCCATGT 


CTTTGACGTC 


ATCATTCTGG 


4101 


GTCTTATTGT 


CTTAAATATG 


ATTATCATGA 


TGGCTGAATC 


TGCCGACCAG 


4151 


CCCAAAGATG 


TGAAGAAAAC 


CTTTGATATC 


CTCAACATAG 


CCTTCGTGGT 


4201 


CATCTTTACC 


ATAGAGTGTC 


TCATCAAAGT 


CTTTGCTTTG 


AGGCAACACT 


4251 


ACTTCACCAA 


TGGCTGGAAC 


TTATTTGATT 


GTGTGGTCGT 


GGTTCTTTCT 


4301 


ATCATTAGTA 


CCCTGGTTTC 


CCGCTTGGAG 


GACAGTGACA 


TTTCTTTCCC 


4351 


GCCCACGCTC 


TTCAGAGTCG 


TCCGCTTGGC 


TCGGATTGGT 


CGAATCCTCA 


4401 


GGCTGGTCCG 


GGCTGCCCGG 


GGAATCAGGA 


CCCTCCTCTT 


TGCTTTGATG 


4451 


ATGTCTCTCC 


CCTCTCTCTT 


CAACATCGGT 


CTGCTGCTCT 


TCCTGGTGAT 


4501 


GTTCATTTAC 


GCCATCTTTG 


GGATGAGCTG 


GTTTTCCAAA 


GTGAAGAAGG 


4551 


GCTCCGGGAT 


CGACGACATC 


TTCAACTTCG 


AGACCTTTAC 


GGGCAGCATG 


4601 


CTGTGCCTCT 


TCCAGATAAC 


CACTTCGGCT 


GGCTGGGATA 


CCCTCCTCAA 


4651 


CCCCATGCTG 


GAGGCAAAAG 


AACACTGCAA 


CTCCTCCTCC 


CAAGACAGCT 


4701 


GTCAGCAGCC 


GCAGATAGCC 


GTCGTCTACT 


TCGTCAGTTA 


CATCATCATC 


4751 


TCCTTCCTCA 


TCGTGGTCAA 


CATGTACATC 


GCTGTGATCC 


TCGAGAACTT 


4801 


CAACACAGCC 


ACGGAGGAGA 


GCGAGGACCC 


TCTGGGAGAG 


GACGACTTTG 


4851 


AAATCTTCTA 


TGAGGTCTGG 


GAGAAGTTTG 


ACCCCGAGGC 


GTCGCAGTTC 


4901 


ATCCAGTATT 


CGGCCCTCTC 


TGACTTTGCG 


GACGCCCTGC 


CGGAGCCGTT 


4951 


GCGTGTGGCC 


AAGCCGAATA 


AGTTTCAGTT 


TCTAGTGATG 


GACTTGCCCA 


5001 


TGGTGATGGG 


GGACCGCCTC 


CATTGCATGG 


ATGTTCTCTT 


TGCTTTCACT 


5051 


ACCAGGGTCC 


TCGGGGACTC 


CAGCGGCTTG 


GATACCATGA 


AAACCATGAT 


5101 


GGAGGAGAAG 


TTTATGGAGG 


CCAACCCTTT 


TAAGAAGCTC 


TACGAGCCCA 


5151 


TAGTCACCAC 


CACCAAGAGG 


AAGGAGGAGG 


AGCAAGGCGC 


CGCCGTCATC 


5201 


CAGAGGGCCT 


ACCGGAAACA 


CATGGAGAAG 


ATGGTCAAAC 


TGAGGCTGAA 


5251 


GGACAGGTCA 


AGTTCATCGC 


ACCAGGTGTT 


TTGCAATGGA 


GACTTGTCCA 


5301 


GCTTGGATGT 


GGCCAAGGTC 


AAGGTTCACA 


ATGACTGAAC 


CCTCATCTCC 


5351 


ACCCCTACCT 


CACTGCCTCA 


CAGCTTAGCC 


TCCAGCCTCT 


GGCGAGCAGG 


5401 


CGGCAGACTC 


ACTGAACACA 


GGCCGTTCGA 


TCTGTGTTTT 


TGGCTGAACG 


5451 


AGGTGACAGG 


TTGGCGTCCA 


TTTTTAAATG 


ACTCTTGGAA 


AGATTTCATG 


5501 


TAGAGAGATG 


TTAGAAGGGA 


CTGCAAAGGA 


CACCGACCAT 


AACGGAAGGC 


5551 


CTGGAGGACA 


GTCCAACTTA 


CATAAAGATG 


AGAAACAAGA 


AGGAAAGATC 
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5601 CCAGGAAAAC TTCAGATTGT 

5651 CGGTGTTTTG AGTATGTGAC 

5701 GTCAAAACCC TGCAGTAAGT 

5751 ATCACAGAAT TGGGTGTATG 

5801 TCAGTCAGCA CCCCGACTTT 

5851 AACGAATAAA TAGGTAAAAG 



GTTCTCAGTA CATTCCCCAA TGTGTCTGTT 
CTGCCACATG TAGCTCTTTT TTGCATGTAC 
TAATAGCTTG CTACGGGTGT TCCTACCAGC 
ACTCAAACCT AAAAGCATGA CTCTGACTTG 
CAGACGCTCC AATCTCTGTC CCAGGTGTCT 
AAAAA 
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FIG. 2 A 

Predicted amino acid sequence of ral NaN (1765 a.a). 

1 MEERYYPVIF PDERNFRPFT SDSLAAIEKR lAIQKERKKS KDKAAAEPQP 

51 RPQLDLKASR KLPKLYGDIP PELVAKPLED LDPFYKDHKT FMVLNKKRTI 

DI-Sl 

Ifll YRFSAKRALF ILGPFNPLRS LMIR ISVHSV FSMFIICTVI INCMPMAM .qM 

DI-S2 DI-S3 
151 ERSFD NDIPE YVFIGIYILB AVIKILA RGF IVDEFSFLRD PWNWLDPIVT 

DI-S4 

201 GTAIATCFPG SOVNL SALRT FRVFRALKAI SVISGLKV IV GALLRSVKKL 
DI~S5 

251 VDVMVLTLFC LSTFAL VGOO LFM GILMQKC IKHNCGPNPA SNKDCFEKEK 

DI-SSl 

301 DSEDFIMCGT WLGSRPCPNG STCDKTTLNP DNNYTKF DNF GWSFLAMFRV 

DI-SS2 DI-S6 
351 MT ODSWE RLY RQILRTSGIY FVFFFVWIF LGSFYLLNLT LAWTMAY ER 

401 QNRNVAAETE AKEKMFQEAQ QLLREEKEAL VAMGIDRSSL NSLQASSFSP 

451 KKRKFFGSKT RKSFFMRGSK TAQASASDSE DDASKNPQLL EQTKRLSQNL 

501 PVDLFDEHVD PLHRQRALSA VSILTITMQE QEKFQEPCFP CGKNLASKYL 

DII-Sl 

551 VWDCSPQWLC IKKVL RTTMT DPFTRr.AITI CITINTVFLA V EHMNMPDMr, 

DII-S2 DII-S3 
601 KTT.LKTGN WV FTGIFIAEMC LKIIA LDPYH YF RHGW^fVFD SIVALLSLAD 

DII-S4 

651 VLYlJTLSDhIN RS FLASLRVL RVFKLAKSWP Tr. MTr.TKTTr: HSVGALGNLT 

DII-S5 ^ DII-SSl 

701 WT..TTWFIF SVVGMRLFGT KFNKTAYATQ ERPRRRWHMD NFYHSFLWF 

DII-SS2 DII-S6 
751 RILCSEWIEN MWGCMQDMDG SP LCIIVFVL IMVTnKr.WL NLFIALLLNS 
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801 ESNEEKDGSL EGETRKTKVQ LALDRPRRAF SFMLHALQSF CCKKCRRKNS 
851 PKPKETTESF AGENKDSILP DARPWKEYDT DMALYTGQAG APLAPLAEVE 

901 DDVEYCGEGG ALPTSQUSAG VQAGDLPPET KQLTSPDDQG VEMEVFSEED 

951 LHLSIQSPRK KSDAVSMLSE CSTIDLNDIF RNLQKTVSPK KQPDRCFPKG 

DIII-Sl 

1001 LSCHFLCHKT DKRKSPWVLW WNIRKTCYQ I VKHSWFESFI IFVILLSSGA 

DIII-S2 

1051 LIFEDVNLPS RPOVEK LLRC TDNIFTFIFL LEMILKWVAF GF RRYFT5AW 

DIII-S3 DIII-S4 
1101 CWLDFLIWV SVLSLM NL PS LKSFRTLRAL RPLRALSOFE GMKVWYALI 

DIII-S5 

1151 SAIPAIL NVL LVCLTFWLVF CII^VNLFSG KFGRCINGTD INMYLDFTEV 

DIII-SSl DIII-SS2 
1201 PNRSQCNISN YSWKVPQVNF D NVGNAYLAL LOV A TYKGWL E IMNAAVDSR 

DIII-S6 

1251 EKDEQPDFEA N LYAYLYFW FIIFGSFFTL NLFIGVII DN FNQQQKKLGG 

DIV-Sl 

1301 QDIFMTEEQK KYYNAMKKLG TKKPQKPIPR PLNRCQAFVF DLVTSHVFDV 

1351 IILGLIV LNM IIMMAESADQ PKDVKKTFDI LNIAFWIFT lECLIKVFAL 

DIV-S4 

1401 RQHYFTNGWN LFDCWWLS IISTLVSRLE DSDISFPP TL FRWRLARIG 

DIV-S5 

1451 PXLRr-^VRAftR qiRTLLFALM MSLPSLFNIG LLLFLVMFTV AIFGM5WF5K 

DIV-SSl DIV-SS2 
1501 VKKGSGIDDI F NFETFTGSM LCLFOITTSA GWDT LLNPML EAKEHCNSSS 

DIV-S6 

1551 QDSCQOP OIA WYFVSYIII SFLIWNMYI AVTL ENFNTA TEESEDPLGE 
1601 DDFEIPYEVW EKFDPEASQF IQYSALSDFA DALPEPLRVA KPNKFQFLVM 



1651 
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FIG. 2C 

1701 YEPIVTTTKR KEEEQGAAVI QRAYRKHMEK MVKLRLKDRS SSSHQVFCNG 
1751 DLSSLDVAKV KVHND* 
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FIG. 7A-I 

Sequence of th mouse N&K cDNA. 



I 


TCTGAGCCAA 


GGGTGAAGAX 


SGAGGAGAGG 


TACTATCCAG 


TGATCTTCCC 


AGACGAGAGG 


61 


AATTTCCGCC 


CCTTCACTTT 


CGACTCTTTG 


GCTGCAATAG 


AGAAGCGGAT 


CACCATCCAA 


121 


AAGGAGAAOA 


AGAAATCCAA 


AGACAAGGCA 


GCAACTGAGC 


CCCAGCCTCG 


GCCTCAGCTC 


181 


GACCTAAAGO 


CCTCCAGGAA 


GTTACCTAAG 


CTCTATGGCG 


ACGTTCCCCC 


TGACCTTATA 


241 


GCGAAGCCCC 


TGGAAGATCT 


GGACCCATTT 


TACAAAGACC 


ATAAGACATT 


CATGGTATTG 


301 


AACAAGAAGX 


GAACAATCTA 


TCGCTTCAGC 


GCCAAGAGGG 


CCTTGTTCAT 


TCTGGGGCCT 


361 


TTTAATCCCA 


TCAGAAGCTT 


CATGATTCGC 


ATCTCTGTCC 


ATTCAGTCTT 


CAXJCATGTTC 


421 


ATTATCTGCA 


CAGTGATCAT 


CAACTGTATG 


TTCATGGCTA 


ATAATTCTTC 


TGTGGACAGT 


481 


CGTCCTAGCA 


GTAACATTCC 


CGAATACGTC 


TTCATTGGQA 


TTTATGTTrr 


AOAAGCTGTG 


S41 


ATTAAAATAT 


TQOCAAGAiGG 


CTTCATTGTG 


GATGAflTTTT 


CCTACCTCCG 


AGATCCTTGa 


601 


AACTGGCTGG 


ACTTCATTaT 


CATCGGAACA 


GCXIATAGCGC 


CTTGTTTTCT 


CGGTAAGAAA 


661 


GTCAATAATC 


TTTCCACTCT 


ACGTACCTTC 


CGAflTGTTOA 


OAflCTCTOAA 


AGCCATTTCT 


721 


GTAATCTCAO 


GTCTGAAOGT 


CATCX3TGGGT 


GCCCTGCTGC 


GCTCCOTGAA 


GAAGCTAGTa 


781 


GACGTGATGa 


TCCTCACTCT 


CTTTTGCCTC 


AGCATCTTTG 


CCCTGGTTGG 


TCAGCAGCTG 


841 


TTCATGOGAA 


TTCTGAGCCA 


GAAATGTATT 


AAGGACGACT 


G1X3GCCCTAA 


CGCTTTTTCC 


901 


AACAAGOATT 


GCTTTGTAAA 


AGAAAATGAT 


AGCGAGGACT 


TCATAATGTG 


TGGCAACTGG 


961 


CTCGGCAGAA 


GATCCTGCCC 


CGATGGTTCC 


ACGTGCAATA 


AAACCACATT 


TAACCCAOAT 


1021 


TATAATTATA 


CAAACTTTGA 


CAOCTTTGGC 


TGGTCTTTTC 


TCGCCATGTT 


CCGOGTTATa 


1081 


ACTCAAGACT 


CCTGGGAGAA 


GCTTTATCGA 


CAGATCCTTC 


GCACCTCCGG 


GATCTACTTT 


1141 


GTCTTCTTCT 


TCGTGGTCGT 


CATCTTCCTG 


GGCTCTTTCT 


ACCTGCTTAA 


CTTAACCCra 


1201 


GCTGTCaTCA 


CCATGGCTTA 


CGAGGAACAG 


AACAGAAATO 


TCGCTGCCGA 


GACAGAGGCC 


1261 


AAGGAGAAGA 


TGTTTCAGGA 


AGCCCAGCAG 


CTGTrOAGGa 


AGGAAAAGGA 


GGCTCTGGTT 


1321 


GCCATGGGAA 


TTGACAGAAC 


TTCCCTTAAT 


TCCCTCCAAG 


CTTCGTCCTT 


TTCCCCAAAG 


1381 


AAGAGGAAGT 


rnTTGGCAfl 


TAAGACAAGA 


AAGTCCTTCT 


TTATGAGAGG 


GTCCAAGACA 


1441 


GCCCGAOCCT 


CAGGQTCCGA 


TTCAaAGGAC 


GATGCCTCTA 


AAAACCCACA 


ACTCCTTGAO 


ISOl 


CAAACAAAAC 


GACTATCCCA 


GAACTTGCCC 


GTAGAACTCT 


TTGATGAGCA 


CGTGGACCCC 


1S61 


CTCCATAOGC 


AGAflAGCGCT 


GAGTGCCGTC 


AGTATCTTAA 


CCATCACCAT 


GCAGG AACAA 


1621 


GAAAAATCCC 


AGGAaCCTTG 


TTTCCCGTGT 


GGGAAAAACT 


TGGCATCCAA 


GTACCTGGTa 


1681 


TGGGAATGTA 


GCCCTCCGTG 


GCTGTGCATA 


AAGAAGGTCC 


TGCAGACTAT 


CATGACAGAC 


1741 


CCCTTCACTG 


AGCTGGCCAT 


CACCATCTGC 


ATCATCGTCA 


ATACTGTCTT 


CTTGGCCATa 


1801 


GAACACCACA 


ATATGGATAA 


CTCTTTAAAA 


GACATACTGA 


AAATAGGAAA 


CTGGGTTTTC 


1861 


ACTGGAATTT 


TCATAGCGGA 


AATGTGTCTC 


AAGATCATTG 


CGCTAOACCC 


TTACCACTAC 


1921 


TTCCGGCACQ 


GCTGGAACAT 


CTTTGACAGC 


ATTGTGGCCC 


TTGTGAOTCT 


CGCTGACGTG 
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ISai CTCTTCCACA AACTGTCTAA AAACCTCTCC TTCTTGGCTT CCCTCAGAGT GCTGAGGGTC 

2041 TTCAAGTTAQ CCAAATCCTG GCCCACATTA AACACTCTCA TTAAGATCAT CGGCCACTCC 

2101 GTGGGTGCGC TCGGAAACCT GACTGTGGTC CTAACGATCG TGGTCTTCAT CTTTTCCGTG 

2161 GTTGGCATGC GGCTCTTTGG TGCCAAGTTT AACAAGACTT GCTCCACCTC TCCGGAGTCC 

2221 CTCCGGCGCT GGCACATGGG TGATTTCTAC CATTCCTTCC TGGTGGTGTT CCGCATCCTC 

2281 TGTGGGGAGT GGATCGAGAA CATGTGGGAA TGCATGCAGG AGATGGAAGG CTCCCCGCTG 

2341 TGTaTCATCG TCTTTGTGCT GATCATGGTG GTCGGGAAGC TCGTGGTGCT TAACCTCTTC 

2401 ATTGCCTTGC TGCTCAATTC CTTCAOCAAT GAGGAAAAGG ATGGGAACCC AGAAGGAGAO 

2461 ACCAGGAAAA CCAAAGTGCA GCTAGCCCTG GATCGGTTCA GCCGAGCGTT CTACTTCATG 

2S21 GCGCGCGCTC TTCAGAATTT CTGTTGCAAG AGATGCAGGA GGCAAAACTC GCCAAAGCCA 

2S81 AATGAGGCAA CAGAAAGCTT TGCTGGTGAG AGTAGAGACA CAGCCACCCT GGATACAAGG 

2641 TCCTGGAAOa AOTATGATTC AGAAATGACT CTGTACACTXI GGCAQGCCGG GOCTCCACTG 

2701 GCCCCACTGG CAAAAGAAGA GGACGATATG GAATGTTGTG QTGAATGTaA TGCCTCACCT 

2761 ACCTCACAGC CTAaTGAOaA AGCTCAGGCC TGTGACCTCC CTCTOAAGAC CAAGCGGCTC 

2821 CCCAGCCCAG ATGACCACGG GQTTGAAATG GAAGTGTTTT CCGAAGAAflA TCCX3AATTTA 

2881 ACCATACAflA GTGCTCGAAA GAAGTCTOAT GCGGCAAiGCA TGCTCTCAGA ATGCAGCACA 

2941 ATAOACCraA ATGATATCTT TAGAAATTTA CAGAAAACAO TTTCCCCCCA AAAOCAACCA 

3001 GATCGATQCT TTCCCAAGOG CCTCAOTTGT ATCTTTCTAT OTXGCAAAAC AATCAAAAAA 

3061 AAGTCCCCCT GGGTCCTGTG OTGOAATCTT CGGAAAACCT OCTACCAAAT CGTOAAGCAT 

3121 AGCTGGTTTa AOAGCTTCAT AATTTTTGTC ATCCTGCTGA GCAOCGGAOC ACTGATATTC 

3181 GAAGATQTCA ATCTTCCCAG CCGGCCCCAA QTTGAAAAAT TACTGAAOTO TACCGATAAT 

3241 ATTTTCACAT TTATTTTrCT CCTGOAAATG ATTTTGAAGT GGGTGGCCTT TGGATTCCGG 

3301 AAGTATTTCA CCAQTGCCTG GTGCTGGCTC GATTTCCTCA TTGTGGTGGT GTCTGTGCTC 

3361 AGCCTCACGA ACTTACCAAA CTTGAAGTCC TTCCGGAATC TQCGAGCGCT GAGACCTCTG 

3421 CGGGCACTGT CTCAaXTTGA AGGAATGAAG GTTGTTGTCA ATGCCCTCAT GAGTGCCATA 

3481 CCTGCCATCC TCAATGTCTT GCTGGTCTGC CTCATTTTCT GGCTCATATT TTGTATCCTG 

3541 GGAGTAAATT TTTTTTCTGG GAAGTTTGGA AGATGCATTA ATGGAACAGA CATAAATAAA 

3601 TATTTCAACG CTTCCAATQT TCCAAACCAA AGCCAATOTT TAGTTAGTAA TTACACGTGG 

3661 AAAGTCCCGA ATGTCAACTT TGACAACGTG GGGAATGCCT ACCTTGCCCT GCTGCAAGTQ 

3721 GCGACCTATA AflGGCTGaCT GGACATTATG AATGCAGCTG TTGATTCCAG AGGGAAAGAT 

3781 GAGCAGCCQG CCTTTGAGGC GAATCTATAC GCATACCTTT ACTTCGTGGT TTTTATCATC 

3 841 TTCGGCTCAT TCTXTACCCT GAACCTCTTT ATCGaXGTTA TTATTGACAA CTTCAATCAG 

3 901 CAGCAGAAAA AGTTAGGTGG CCAAGACATT TTTATQACAO AAGAACAGAA GAAATATTAC 

3 961 AATGCAATGA AAAAGTTAGG AACCAAGAAG CCTCAAAAGC CCATCCCAAfl GCCCCTGAAC 
4021 AAATGTCAAO CCTTCGTGTT CG ATTTGGTC ACAAGCCAGG TCTTTGACGT CATCATTCTO 
4081 GGTCTTATTG TCACAAACAT GATTATCATG ATGGCTGAAT CTGAAGGCCA GCCCAACGAA 
4141 GTGAAQAAAA TCTTTGATAT TCTCAACATA GTCTTCGTGO TCATCTTTAC CGTAGAGTGT 

4 201 CTCATCAAAG TCTTTGCTTT GAGGCAACAC TACTTCACCA ATGGCTGGAA CTTATTTGAT 
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FIG. 7A'3 



4261 


TGTGTGGTCG 


TGGTTCTTTC 


CATCATTAGT 


ACCTTGGTTT 


CTGGCTTGGA 


GAACAGCAAC 


4321 


GTCTTCCCGC 


CCACACTCTT 


CAGGATTGTC 


CGCTTGGCTC 


GGATCGGTCG 


AATCCTCAGA 


43ai 


CTGGTCCGGG 


CGGCTCGAGG 


AATCAGGACA 


CTCCTTTTCG 


CGTTGATGAT 


GTCTCTCCCC 


4441 


TCTCTCTTCA 


ACATTGGTCT 


GCTTCTCTTT 


CTGGTGATGT 


TCATTTATGC 


CATCTTTGGG 


4S01 


ATGAACTGOT 


TTTCCAAAGT 


GAAGAGAGGC 


TCTGGGATTG 


ATGACATCTT 


CAACTTTGAC 


4S61 


ACTTTCTCGG 


GCAGCATGCT 


CTGCCTCTTC 


CAGATAACCA 


CTTCAGCCGG 


CTGGGATGCT 


4621 


CTCCTCAACC 


CCATGCTGGA 


ATCAAAAGCC 


TCTTGCAATT 


CCTCCTCCCA 


AGAGAGCTGT 


4681 


CAGCAGCCGC 


AQATAGCCAT 


AGTCTACTTC 


GTCAGCTACA 


TCATCATCTC 


CTTTCTCATT 


4741 


GTGGTTAACA 


TGTACATAGC 


TGTGATTCTA 


GAGAACTTCA 


ACACAQCCAC 


AGAGGAGAGC 


4801 


GAGQACCCCC 


TGGGCGAAGA 


CGACTTTGAG 


ATCTTCTATa 


AGATCTGGGA 


GAAGTTTGAC 


4861 


CCCGAAGCAA 


CACAGTTCAT 


CCAGTACTCA 


TCCCTCTCTO 


ACTTCGCCGA 


CGCCCTGCCC 


4921 


GAflCCGTTGC 


GTGTGGCCAA 


GCCCAACAGG 


TTTCAGTTTC 


TCATOATGGA 


CTTGCCCATG 


4981 


GTGATGGGTG 


ATCGCCTCCA 


TTGCATGOAT 


GrrCTCTITG 


CTTTCACCAC 


CAGGGTCCTC 


S041 


GGGAACTCCA 


GCGGCTTGGA 


TACCATGAAA 


GCCATOATGG 


AGGAGAAQTT 


CATGGAGGCC 


5101 


AATCCTTTCA 


AGAAGTTGTA 


CGAOCCCATT 


GTCACCACCA 


CAAAGAGGAA 


GGAGGAGGAG 


5161 


aAATOTOCCa 


CTOTCATCCA 


GAGGGCCTAC 


CQGAiQACACA 


TGOAaAAiOAt 


GATGAAGCTG 


5221 


AAXSCTQAAAO 


GCAGOTCAAG 


TTCATCGCTC 


CAOOTOTTTT 


GCAATGQAGA 


CTTGTCTAOC- 


5281 


TTGOATQTGC 


GCAAGATCAA 


GGTTCATTGT 


GACTOAAACC 


CCCACCroCA 


CGCCTACCTC 


5341 


ACAflCCTCAC 


AGCTCAfiCCC 


CCAGCCTCXa 


GCGAACAAiOC 


GaCQGACTCA 


CCGAACAOGC 


5401 


CGTTCAACTT 


crmTrrrGG 


GTOAAAGAGG 


TOATAGGrra 


OTQTCCATTT 


TTAAATGATT 


5461 


CTTGGAAAGA 


rroAAcaTCG 


GAACATOTTA 


GAAAGOACra 


CCAAGGACAT 


CCACAGTAAC 


5521 


GGAAGGCCTG 


AAGGACAGTT 


CAAATTATGT 


AAAGAAACOA 


GAAGGAAAGO 


TCACATGTCT 


5581 


GTTCAGTTTT 


AAGTATGTGA 


CCTGCCACAT 


GTAGCTCCTT 


TGCATGTTAA 


GTGAGAAGTC 


5641 


AAAACCCTGC 


CATAAQTAAA 


TAGCTTTGTT 


GCAGGTGTTT 


CTACCAGTOC 


TGCGGATTTa 


5701 


GGTGTATGGC 


TCAAACCTGA 


AAGCATGACT 


CTGACTTGTC 


AGCACCCCAA 


CTTTCAGAAa 


5761 


CTCTOATCTC 


TGTCCTAGGT 


GTTTGACAAA 


XAAATACATA 


AAANAAAAAA 


AAAAAAAAAA 


5821 


AA 
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FIG. 7B-I 

Protein sequence of mt/aN 

Molecular Weight 201451.00 Daltona 
1765 Amino Acids 

198 Strongly Basic (h-) Amino Acids (K,R) 
177 Strongly Acidic {-) Amino Acids (D,E) 
712 Hydrophobic Amino Acids {A, I,L,F, W, V) 
453 Polar Amino Acids (M,C,Q, S,T, Y) 

8.260 Isolectric Point 
22.540 Charge at PH 7.0 

I KEERYYPVIF PDSRHFRPFT FDSLAAIRKR ITtQICBiaaCS KDfCMTBPQP RPQLOLKASR 

21 KLPiCLYQDVP PDLIAKPLED LDPFYKDHKT FMVLMKKRTI YRFSAKRAIiF IliOPFHPIRS 
121 FMIRISVHSY FSHFIICTYI THCHFHAWH3 svdsRPSSMT pgYVPTfiTW leavticttap 

OI-Sl DZ-S2 
181 QFIVDBFSYL RDPWHWLHFT VTOTATAPCP LGWKWWLST LRTFRVT,BAT. KATSVTSnMC 

DI-S3 DI-S4 
241 VIVOALliRSV KKLVDV MYr^T LFCLSTFALV CQQLFM QILS QKCIKDOCQP KAFSMKDCFV 

0I-S5 

3 01 KENDSEDFIM CGNWLGRRSC PDGSTCNKTT FMPDYNYTMF DSPGWSPriAM FHVM TOf)<;wq 

DI-SSl DI-SS2 
3^1 KLVRQILRTS QIY FVFFFW VTPLGSFYT.L WT.TTJIVVTMA Y EEgmMVAA ETEAKEKMFQ 

DI-S< 

421 EAQQLLRSBK EALVJUIQIDR TStHSLQASS FSPKKRKFFO SKTRKSFFKR GSKTARASAS 
481 DSGODASKKP QLLBQTKRLS QKLPVELFD8 HVDPLHKQRA LSAVSILTIT MOEQEKSQEP 
541 CFPCaKMWlS KYLWBCSPP MLCriaCVL QT TMTDPFTErA rTrcriVMTV FLAM EHHMMD 

DII-Sl 

601 MaLKDTLKTfl ffl^VFTOTFTA EMrT.ICTTAI.D PYHYFRHfiWN IFPSIVALVa IiAnVIiFHKLS 
DII-S2 0II-S3 

661 fgm.SFT.AST.p VT.BVFKi.Ays wPTtiTTLiKi IGHSVGALGM LTWIiTIWF rFSWfiHRLF 
0II-S4 Dir-ss 

721 GAKFNKTCST SPESLRRWHM GD FYHSFLW F RILCGEWTE NHWECMQEMB flSPLrVTVFV 

DZI-SSl Dir-SS2 
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FIG. 7B-2 

781 LrMWr.ta.W LmVFTALLLH SFSNEEKDGN PEGETRKTKV QLALDRFSRA FYFMARALQM 
Dir-S6 

841 FCCKRCRRQM SPKPMEATES FAGESRDTAT LDTRSWKEYD SEMTLYTGQA GAPLAPLAKE 

901 EDDMECCGEC DASPTSQPSE EAQACDLPLK TKRLPSPDOH GVEMEVFSEE DPKLTIQSAR 

361 KKSDAASMLS ECSTIDLNDI FRMLQKTVSP QKQPDRCFPK GLSCIFLCCK TIKKKSPWVL 

1021 WWMLRICrCYQ TVKWgWFESF TrFVTLLSSn ALTFEDVMT.P SRPQVE KLLK CTPMTFTFTF 

DIII-Sl DIII-S2 

1081 LtiEHILKWVA FqFRKYFTSA HCWLDFLIVY VSVLSI.TWLP MLicsFPMT.giL lrpt.ralsqf 

DIII-S2 0III-S3 DXII-S4 

1141 EOMKWVNAli MSATPft^tLM V LLVCLIFWTiT FCTLGVUFFS G KFQRCIMQT OIKKyPNASH 

DIII-S5 

1201 VPMQSQCLVS NYTWKVPMVN FDHVCiNAYLA LLQVAT]agIH--LDIMHAAVDS RGKDEQPAFB 

DIII-SSl DIII-SS2 
1261 AWL YftYT.YFV VPYTFGSFFT LmiFTGVTI D KFKQQQKKLa OQDUlfrBBQ KKYYWAMKKL 
DIII-S6 

1321 GTKKPQKFIP RPLiOCCQAFV FDriVTSOVm VIIMIilVnr HTTHHAgSBq QPNBVKXIFD 

OIV-Sl 

1381 TT,WTVFWT P T^rT.TKVFA LRQHYFTHGW HLFDCWWL SIiaTLVaQti BMSMVPPPH, 

DIV-Sa DIV-93 
1441 FRTVRTJVRTq RTT.PT.VRAAR QTRT LLPALM MSLPSLPWTfl IJiTiniVHPTY ATFGMHWFSK 

DIV-S4 DIV-SS 
ISOl VKROSaiDDI FN FDTFSGSM T.CT.FQIT T SA GWDAL LITPML BSKASCafSSS QBSCQQPQIA 

DIV-SSl DIV-SS2 
ISfil TVYFVSYTTT SFLTWNMYT AVTL EKFHTA TEESBDPLGB DDFBIFYBIW EKFDPBATQF 

DIV-S6 

1621 IQYSSLSDPA DALPEPLRVA KPHRFQFLMM DLPMVMGDRL HCMDVLFAPT TRVLGMSSGL 
1681 DTMKAMMBEK FMEAMPFKKL YBPIVTTTKR KEEEBCAAVI QRAYRRHMBK MIKLKLKGRS 
1741 SSSLQVFCKO DLSSliDVPKI KVHCD. 
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FIG. 8 A * / Human fcaN Nucleotide Sequence 

TCCATTOTCATrGGAATAGCGATTGTGTCATATATrCCAGGAATCACCATCA^ 

TGCGTACCnTCCXjTGTGTTCAGAGCTTTGAAAGCAAmCAGTA 

CGTGGGGGCCTTGCTACGCTCTGTGAAGAAGCTGGTCAACGTGATTATCCTCACCI ici vi iGC 

CTCAGCATCTTrGCCCTGGTAGGTCAGCAGCTCTTCATGGGAAGTCTGAACCTGA^ 

CGAGGQACTGTAAAAATATCAGTAACCCGGAAGCTTATGACCATrGCTTTGAAi^ 

ArrCACCTGAATTCAAAATGTGTGGCATCTGGATGGGTAACAGTGCCTGTTCCATAC^ 

ATGTAAGCACACCAAAATTAATCCTGACTATAATTATACGAATmGACAAC^ 

TTTCITGCCATGTTCCGGCTGATOACCCAAGATTC^ 

GTACTACTGGGCTCTACTCAGTCTTCTTCnTCATO 

TTAACTTAACCCTGGCTGrrGTTA(X;ATGGCATATGAGGAGCAGAA 

AGATAGAQGCrAAGGAAAAGATGTTrCAGGAAGCCCAGCAGCTGTTAAAGGAGGAAAAGGAQ 

GCTCTGGTTGCCATGGGAATTGACAGAAGTTCACTTACTTC^^ 

CAAAAAAGAGAAAGCTCTTTGQTAATAAGAAAAGGAAGTCCrrCTrm 

AAGACCAGCCTCCTGGGTCAGATTCTGATGAAGATTGCCAAAAAAAGCCACAGCTCCT 

AAACCAAACGACTGTCCCAGAATCTATCAYTGQACCACTTrGATGAGCATM 

AAGGCAGAGAGCACTGAGTGCTGTCAQCATCCTCACCATCACCATGAAGGAACAAGAAAAATC 

ACAAGAGCCTTGTCTCCCTTGTGGAGAAAACCTGGCAT^ 

CCCCAQTGGCTGTGajTTAAGAAGGTCCTGAGAACTGTGATGACrGACCCGT^ 

CCATCACCATCTGCATCATCATCAACACTGTCTTCrrGGCCATGG 

CAGTTrrGAGAAGATGTrGAATATAGGGAATTTGGTmCACTAGCAT^^ 

TGCCTAAAAATCATTGCGCTCGATCCCTACCACTACTTTCGCCGAGGCT^ 

GCATTGTTGCTCTrCTGAGTTTTGCAGATGTAATGAACTGTGTAC^ 

ATrCrTGajTTCCTTCAGAGTGCTCAGGGTCTTCA^^ 

ACACTAATTAAGATAATCGGCAACTCTGTCGGAGCCCTTGGAAGCCTGACTGT^^ 

TTGTGATCTTTATmCTCAGTAGTTGGCATGCAGC^^ 

AGTCCAAAACTCTGTAACCCGACAGGCCCGACAGTCrCATGTTTA 

GATTTCrGGCACTCCTTCCTAGTGGTATTCCGCATCCTCTGC^ 

GGGAATGTATGCAAGAAGCGAATGCATCATCATCATTGTGTGTTATTGTOT 

GGTGATAGQAAAACrrGTGGTGCTCAACCTCTTCATTGCCTTACT^ 

GAGOAAAGAAATGGAAACTTAGAAGGAGAGGCCAGGAAAACTAAAGTCCAGTTAGC^CTGGA 
TCGATTCCGCCGGGCrrrTTGTTTTGTGAGACACACTOT 

GGAAGCAAAAOTACrACAGCAAAAAGAGGTGGCAGQAGGCTGTGCTGCACAAAGCAAAG 

ATCATTCCCCTGGTCATGGAGATGAAAAGGGGCTCAGAGACCCAGGAGGAGCTTGGT^^ 

ACCTCrGTACCAAAGACCCTGGGCGTCAGGCATGATTGGACTTGGTTGGCACCACTTGCGGA^ 

GAGGAAGATGACGTTGAATTTTCTGGTGAAGATAATGCACAGCGCATCACACAACCTGAGCCr 

GAACAACAGGCCTATGAGCTCCATCAGGAGAACAAGAAGCCCACGAGCCAGAGAGTTCAAAG 

TGTGGAAATTGACATGTTCrCTGAAGATGAGCCTCATCTGACCATACAGGATCCCCGAAAGAA 

GTCTGATGTTACCAGTATACTATCAGAATGTAGCACCATTGATCTTCAGGATGGCT^ 

TTACCTGAGATGGrrCCCAAAAAGCAACCAGAGAGATGTTTGCCCAAAGGCnTTGGT^^ 

TTCCATGCTGTAGCGTGGACAAGAGAAAGCCTCCCTGGGTCATTTGGTGGAACCTGCGGAAAA 
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CCTGCTACCAAATAGTGAAACACAGCTGGTTTGAGAGCnTAm 
CAQTGGGGCACTGATATrnSAAGATGTTCACCTTGAGAACCAACCCAAA^ 
AAATTGTACTGACAmTTmACACATATTTrr 
TTCGGATTT^AAAGTATTTCACCAGTGCCTGG^^ 

TGTGACCACCCTCATTAACTTAATGGAATTGAAGTCCTTCCGGACTCTACG^ 
CTTCGTGCGCTGTCCCAGTrrGAAGGAATGAAGGTGGTGGTCAATGCrCTCATAGGTG 
CTGCCATTCTGAATGTTTTGCTTGTCTGCCTCATT^ 
ACTTCTTTrCTGGAAAATrrGGGAAATGCATT/^^ 

CATTACAAATAAAAGTCAATGTGAAAGTGGCAATTTCTCTrGGATCAAaiAGAAA 
GACAATGTGGGAAATGCTTACCTCGCrCTGCTOCAAO 
ATTATATATGCAGCTGTTGATTCCACAGAOAAAGAACAACAGCCAGAGT^ 
CTCGQTTACArrTACTTCGTAGTCTTrATCATC^ 

GGCGTTATCATTGACAACITCAACCAACAGCAGAAAAAGTTAGGTGGC^ 

ACAGAAGAACAGAAGAAATACTATAATGCAATGAAAAAATTAGGATCCl^AAAAAC^ 

ACCCATTCCACGGCXTCTGAACAAATGTCAAGGTCrCGTGTrCGACATA 

TirGACATCATCATCATAAGTCTCATTATCCTAAACATGATrAGCATGATGG^ 

ACCAACCCAAAGCCATGAAATaL\TCCrrGACCATCTCAACTGGGTCm 

GmGAATXn'CTCATCAAAATCrrrrGCriTGAGGC^ 

TTTGA 
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FIG. 8B 

Partial Human NaNAmino Acid Sequence 
SI VIGIAIVSYI PGITIKLLPL RTFRVFRALK AISWSRLKV IVGALLRSVK KLVNVIILTF 
FCLSIFALVG QQLFMGSLNL KCISRDCKNI SNPEAYDHCF EKKENSPEFK MCGIWMGNSA 
CSIQYECKHT KINPDYNYTN FDNFGWSFLA MFRLMTQDSW EKLYQQTLRT TGLYSVPFH 
WIFLGSFYL INLTLAWTM AYEEQNKNVA AEIEAKEKMF QEAQQLLKEE KEALVAMGId 
RSSLTSLETS YFTPKKRKLF GNKKRKSFFL RESGKOQPPG SDSDEDCQKK PQLLEQTKRL 
SQNLSLDHFD EHGDPLQRQR ALS AVSILTI TMKEQEKSQE PCLPCX3ENLA SKYLVWNCCP 
QWLCVKKVLR TVMTDPFTEL AmcmNT VFLAMEHHKM EASFEKXfLNI GNLVFTSIFI 
AEMCLKIIAL DPYHYFRRGW NIFDSIVALL SFADVMNCVL QKRSWPFLRS 
FRVLRVFKLAKSWPTUmi lOIGNSVGAL GSL1WLVIV IFEFS WGMQ LFGRSFNSQK 
SPKLCNPTGP TVSCLRHWHM GDFWHSFL W FRILCGEWIE NMWECMQEAN ASSSLCVIVF 

arrviGKLV vlnlfialll nsfsneernq nlegearktx vqialdrhui afcfvrhtle 
iifchkwcrkq nlpqqkevag gcaaqskdh plvmemkrgs etqeelgilt svpktlgvrh 
dwtwlaplae eeddvefsge dnaqritqpe peqqayelhq enkkptsqrvqsveidmfsb 
depiiltiqdp riocsdvtsil secsttolqd gfgwlpemvp kkqperclpk gfgccfpcds 

VDKRKPPWVI WWNLRKTCVQ IVKHSWFESF IIFVILLSSG ALIFEDVHLE NQPiUQELLN 
CTDnrnnF ILE^fVLKWVA FGFGKYFTSA WCCLDFnVI VSVTTLINLM ELKSFRTLRA 
LRPLRALSQF EGMKWVNAL IGAfPAILNV LLVCLIFWLV FOLGVYFFS GKFGKONGT 
DSVINYTirrNKSQCESGNF SWINQKVNFD NVGNAYLALL QVATFKGWMD IIYAAVDSTE 
KEQQPEFESN SLQYIYFWF nFGSFFTLN LFIGViroNF NQQQKKLGGQ DIFMTEEQKK 
YVNAWKKLGS KKPQKPIPRP LhfKCQGLVFD IVTSQFFDn HSLIILNMI SMMAESYNQP 
KAMKSILDHL NWVFWIFTL ECLIKIFALR QYYFTNGWNL FDCVWLLSIV 
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FIG, 9 

NaN immunostaining in DRG neurons 



c : 
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FIG. 10A 
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FIG. II A 

Sequence o€ human NaN cDNA. Open reading frame (cdc) 

Extends from position 31 (ATG) to the termination codon TGA at 

Position 5400. 

1 ATCTGCTCAA GCCAGGAATC TCGGGTGAAG ATGGATGACA GATGCTACCC 
51 AGTAATCTTT CCAGATGAGC GGAATTTCCG CCCCTTCACT TCCGACTCTC 
101 TGGCTGCAAT TGAGAAGCGG ATTGCCATCC AAAAGGAGAA AAAGAAGTCT 
151 AAAGACCAGA CAGGAGAAGT ACCCCAGCCT CGGCCTCAGC TTGACCTAAA 
201 GGCCTCCAGG AAGTTGCCCA AGCTCTATGG CGACATTCCT CGTGAGCTCA 
251 TAGGAAAGCC TCTGGAAGAC TTGGACCCAT TCTACCGAAA TCATAAGACA 
301 TTTATGGTGT TAAACAGAAA GAGGACAATC TACCGCTTCA GTGCCAAGCA 
351 TGCCTTGTTC ATTTTTGGGC CTTTCAATTC AATCAGAAGT TTAGCCATTA 
401 GAGTCTCAGT CCATTCATTG TTCAGCATGT TCATTATCGG CACCGTTATC 
451 ATCAACTGCG TGTTCATGGC TACAGGGCCT GCTAAAAACA GCAACAGTAA 
501 CAATACTGAC ATTGCAGAGT GTGTCTTCAC TGGGATTTAT ATTTTTGAAG 
551 CTTTGATTAA AATATTGGCA AGAGGTTTCA TTCTGGATGA GTTTTCTTTC 
601 CTTCGAGATC CATGGAACTG GCTGGACTCC ATTGTCATTG GAATAGCGAT 
651 TGTGTCATAT ATTCCAGGAA TCACCATCAA ACTATTGCCC CTGCGTACCT 
701 TCCGTGTGTT CAGAGCTTTG AAAGCAATTT CAGTAGTTTC ACGTCTGAAG 
751 GTCATCGTGG GGGCCTTGCT ACGCTCTGTG AAGAAGCTGG TCAACGTGAT 
801 TATCCTCACC TTCTTTTGCC TCAGCATCTT TGCCCTGGTA GGTCAGCAGC 
851 TCTTCATGGG AAGTCTGAAC CTGAAATGCA TCTCGAGGGA CTGTAAAAAT 
901 ATCAGTAACC CGGAAGCTTA TGACCATTGC TTTGAAAAGA AAGAAAATTC 
951 ACCTGAATTC AAAATGTGTG GCATCTGGAT GGGTAACAGT GCCTGTTCCA 
1001 TACAATATGA ATGTAAGCAC ACCAAAATTA ATCCTGACTA TAATTATACG 
1051 AATTTTGACA ACTTTGGCTG GTCTTTTCTT GCCATGTTCC GGCTGATGAC 
1101 CCAAGATTCC TGGGAGAAGC TTTATCAACA GACCCTGCGT ACTACTGGGC 
1151 TCTACTCAGT CTTCTTCTTC ATTGTGGTCA TTTTCCTGGG CTCCTTCTAC 
1201 CTGATTAACT TAACCCTGGC TGTTGTTACC ATGGCATATG AGGAGCAGAA 
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12 SI CAAGAATGTA GCTGCAGAGA TAGAGGCCAA GGAAAAGATG TTTCAGGAAG 

1301 CCCAGCAGCT GTTAAAGGAG GAAAAGGAGG CTCTGGTTGC CATGGGAATT 

1351 GACAGAAGTT CACTTACTTC CCTTGAAACA TCATATTTTA CCCCAAAAAA 

1401 GAGAAAGCTC TTTGGTAATA AGAAAAGGAA GTCCTTCTTT TTGAGAGAGT 

1451 CTGGGAAAGA CCAGCCTCCT GGGT.CAGATT CTGATGAAGA TTGCCAAAAA 

1501 AAGCCACAGC TCCTAGAGCA AACCAAACGA CTGTCCCAGA ATCTATCACT 

1551 GGACCACTTT GATGAGCATG GAGATCCTCT CCAAAGGCAG AGAGCACTGA 

1601 GTGCTGTCAG CATCCTCACC ATCACCATGA AGGAACAAGA AAAATCACAA 

1651 GAGCCTTGTC TCCCTTGTGG AGAAAACCTG GCATCCAAGT ACCTCGTGTG 

1701 GAACTGTTGC CCCCAGTGGC TGTGCGTTAA GAAGGTCCTG AGAACTGTGA 

1751 TGACTGACCC GTTTACTGAG CTGGCCATCA CCATCTGCAT CATCATCAAC 

1801 ACTGTCTTCT TGGCCATGGA GCATGACAAG ATGGAGGCCA GTTTTGAGAA 

1851 GATGTTGAAT ATAGGGAATT TGGTTTTCAC TAGCATTTTT ATAGCAGAAA 

1901 TGTGCCTAAA AATCATtGCG CTCGATCCCT ACCACTACTT TCGCCGAGGC 

1951 TGGAACATTT TTGACAGCAT TGTTGCTCTT CTGAGTTTTG CAGATGTAAT 

2001 GAACTGTGTA CTTCAAAAGA GAAGCTGGCC ATTCTTGCGT TCCTTCAGAG 

2051 TGCrCAGGGT CTTCAAGTTA GCCAAATCCT GGCCAACTTT GAACACACTA 

2101 ATTAAGATAA TCGGCAACTC TGTCGGAGCC CTTGGAAGCC TGACTGTGGT 

2151 CCTGGTCATT GTGATCTTTA TTTTCTCAGT AGTTGGCATG CAGCTTTTTG 

2201 GCCGTAGCTT CAATTCCCAA AAGAGtCCAA AACTCTGTAA CCCGACAGGC 

2251 CCGACAGTCT CATGTTTACG GCACTGGCAC ATGGGGGATT TCTGGCACTC 

2301 CTTCCTAGTG GTATTCCGCA TCCTCTGCGG GGAATGGATC GAAAATATGT 

2351 GGGAATGTAT GCAAGAAGCG AATGCATCAT CATCATTGTG TGTTATTGTC 

2401 TTCATATTGA TCACGGTGAT AGGAAAACTT GTGGTGCTCA ACCTCTTCAT 

2451 TGCCTTACTG CTCAATTCCT TTAGCAATGA GGAAAGAAAT GGAAACTTAG 

2501 AAGGAGAGGC CAGGAAAACT AAAGTCCAGT tAGCACTGGA TCGATTCCGC 

2551 CGGGCTTTTT GTTTTGTGAG ACACACTCTT GAGCATTTCT GTCACAAGTG 
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FIG. UA-3 



2601 


GTGCAGGAAG 


CAAAACTTAC 


CACAGCAAAA 


AGAGGTGGCA 


GGAGGCTGTG 


2651 


CTGCACAAAG 


CAAAGACATC 


ATTCCCCTGG 


TCATGGAGAT 


GAAAAGGGGC 


2701 


TCAGAGACCC 


AGGAGGAGCT 


TGGTATACTA 


ACCTCTGTAC 


CAAAGACCCT 


2751 


GGGCGTCAGG 


CATGATTGGA 


CTTGGTTGGC 


ACCACTTGCG 


GAGGAGGAAG 


2801 


ATGACGtTGA 


ATTTTCTGGT 


GAAGATAATG 


CACAGCGCAT 


CACACAACCT 


2851 


GAGCCTGAAC 


AACAGGCCTA 


TGAGCTCCAT 


CAGGAGAACA 


AGAAGCCCAC 


2901 


GAGCCAGAGA 


GTTCAAAGTG 


TGGAAATTGA 


CATGTTCrCT 


GAAGATGAGC 


2951 


CTCATCTGAC 


CATACAGGAT 


CCCCX^AAAGA 


AGTCTGATGT 


TACCAGTATA 


3001 


CTATCAGAAT 


GTAGCACCAT 


TGATCTTCAG 


GATGGCTTTG 


GATGGTTACC 


3051 


TGAGATGGTT 


CCCAAAAAGC 


AACCAGAGAG 


ATGTTTGCCC 


AAAGGCTTTG 


3101 


GTTGCTGCTT 


TCCATGCTGT 


AGCGTGGACA 


AGAGAAAGCC 


TCCCTGGGTC 


3151 


ATTTGGTGGA 


ACCTGCGGAA 


AACCTGCTAC 


CAAATAGTGA 


AACACAGCTG 


3201 


GTTTGAGAGC 


TTTATTATCT 


TTGTGATTCT 


GCTGAGCAGT 


GGGGCACTGA 


3251 


TATTTGAAGA 


TGTTCACCTT 


GAGAACCAAC 


CCAAAATCCA 


AGAATTACTA 


3301 


AATTGTACTG 


ACATTATTTT 


TACACATATT 


TTTATCCTGG 


AGATGGTACT 


3351 


AAAATGGGTA 


GCCTTCGGAT 


TTGGAAAGTA 


TTTCACCAGT 


GCCTGGTGCT 


3401 


GCCTTGATTT 


CATCATTGTG 


ATTGTCTCTG 


TGACCACCCT 


CATTAACTTA 


3451 


ATGGAATTGA 


AGTCCTTCCG 


GACTCTACGA 


GCACTGAGGC 


CTCTTCGTGC 


3501 


GCTGTCCCAG 


TTTGAAGGAA 


TGAAGGTGGT 


GGTCAATGCT 


CTCATAGGTG 


3551 


CCATACCTGC 


CATTCTGAAT 


GTTTTGCTTG 


TCTGCCTCAT 


TTTCTGGCTC 


3601 


GTATTTTGTA 


TTCTGGGAGT 


ATACTTCTTT 


TCTGGAAAAT 


TTGGGAAATG 


3651 


CATTAATGGA 


ACAGACTCAG 


TTATAAATTA 


TACCATCATT 


ACAAATAAAA 


3701 


GTCAATGTGA 


AAGTQGCAAT 


TTCTCTTGGA 


TCAACCAGAA 


AGTCAACTTT 


3751 


GACAATGTGG 


GAAATGCTTA 


CCTCGCTCTG 


CTGCAAGTGG 


CAACATTTAA 


3801 


GGGCTGGATG 


GATATTATAT 


ATGCAGCTGT 


TGATTCCACA 


GAGAAAGAAC 


3851 


AACAGCCAGA 


GTTTGAGAGC 


AATTCACTCG 


GTTACATTTA 


CTTCGTAGTC 


3901 


TTTATCATCT 


TTGGCTCATT 


CTTCACTCTG 


AATCTCTTCA 


TTGGCGTTAT 


3951 


CATTGACAAC 


TTCAACCAAC 


AGCAGAAAAA 


GTTAGGTGGC 


CAAGACATTT 
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FIG. nA-4 

4001 TTATGACAGA AGAACAGAAG AAATACTATA ATGCAATGAA AAAATTAGGA 

4051 TCCAAAAAAC CTCAAAAACC CATTCCACGG CCTCTGAACA AATGTCAAGG 

4101 TCTCGTGTTC GACATAGTCA CAAGCCAGAT CTTTGACATC ATCATCATAA 

4151 GTCTCATTAT CCTAAACATG ATTAGCATGA TGGCTGAATC ATACAACCAA 

4201 CCCAAAGCCA TGAAATCCAT CCTTGACCAT CTCAACTGGG TCTTTGTGGT 

4251 CATCTTTACG TTAGAATGTC TCATCAAAAT CTTTGCTTTG AGGCAATACT 

4301 ACTTCACCAA TGGCTGGAAT TTATTTGACT GTGTGGTCGT GCTTCTTTCC 

4351 ATTGTTAGTA CAATGATTTC TACCTTGGAA AATCAGGAGC ACATTCCTTT 

4401 CCCTCCGACX3 CTCTTCAGAA TTGTCCGCTT GGCTCGGATT GGCCGAATCC 

4451 TGAGGCTTGT CCGGGCTGCA CGAGGAATCA GGACTCTCCT CTTTGCTCTG 

4501 ATGATGTCGC TTCCTTCTCT GTTCAACATT GGTCTTCTAC TCTTTCTGAT 

4551 TATGTTTATC TATGCCATTC TGGGTATGAA CTGGTTTTCC AAAGTGAATC 

4601 CAGAGTCTGG AATCGATGAC ATATTCAACT TCAAGACTTT TGCCAGCAGC 

4651 ATGCTCTGTC TCTTCCAGAT AAGCACATCA GCAGGTTGGG ATTCCCTGCT 

4701 CAGCCCCATG CTGCGATCAA AAGAATCATG TAACTCTTCC TCAGAAAACT 

4751 GCCACCTCCC TGGCATAGCC ACATCCTACT TTGTCAGTTA CATTATCATC 

4801 TCCTTTCTCA TTGTTGTCAA CATGTACATT GCTGTGATTT TAGAGAACTT 

4851 CAATACAGCC ACTGAAGAAA GTGAGGACCC TTTGGGTGAA GATGACTTTG 

4901 ACATATTTTA TGAAGTGTGG GAAAAGTTTG ACCCAGAAGC AACACAATTT 

4951 ATCAAATATT CTGCCCTTTC TGACTTTGCT GATGCCTTGC CTGAGCCTTT 

5001 GCGTGTCGCA AAGCCAAATA AATATCAATT TCTAGTAATG GACTTGCCCA 

5051 TGGTGAGTGA AGATCGCCTC CACTGCATGG ATATTCTTTT CGCCTTCACC 

5101 GCTAGGGTAC TCX3GTGGCTC TGATGGCCTA GATAGTATGA AAGCAATGAT 

5151 GGAAGAGAAG TTCATGGAAG CCAATCCTCT CAAGAAGTTG TATGAACCCA 

5201 TAGTCACCAC CACCAAGAGA AAGGAAGAGG AAAGAGGTGC TGCTATTATT 

5251 CAAAAGGCCT TTCGAAAGTA CATGATGAAG GTGACCAAGG GTGACCAAGG 

5301 TGACCAAAAT GACTTGGAAA ACGGGCCTCA TTCACCACTC CAGACTCTTT 
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FIG. IIA-5 

S3 51 GCAATGGAGA CTTGTCTAGC TTTGGGGTGG CCAAGGGCAA GGTCCACTGT 

5401 GACTGAGCCC TCACCTCCAC GCCTACCTCA TAGCTTCACA GCCTTGCCTT 

5451 CAGCCTCTGA GCTCCAGGGG TCAGCAGCTT AGTGTATCAA CAGGGAGTGG 

5501 ATTCACCAAA TTAGCCATTC CATTTTCTTT TCTGGCTAAA ATAAATGATA 

5551 TTTCAATTTC ATTTTAAATG ATACTTACAQ AGATATAAGA TAAGGCTACT 

5601 TGACAACCAG TGGTACTATT ATAATAAGGA AGAAGACACC AGGAAGGACT 

5651 GTAAAAGGAC ATACCAATTT TAGGATTGAA ATAGTTCAGG CCGGGCGCAG 

5701 TGGCTCATGC CTGTAATCCC AGCACTTTGA GAGGCCAAGG CAGGTGGATC 

5751 ACGAGGTCAA GAGATCGAGA CCATCCTGGC CAAGATGATG AAACTCCGTC 

5801 TCTCTAAAAA TACAAAAATT AGCTGGGCAT GGTGGCGTGC GCCTGTAGTC 

5851 CCACTACTTG 
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FIG. I IB 

Sequence of human NaR. 



1 


MDDRCYPVIF 


PDERNFRPFT SDSLAAIEKR 


lAIQKEKKKS 


KDQTGEVPQP 


51 


RPQLDLKASR 


KLPKLYGDIP RELIGKPLED 


LDPFYRJtmiCr 


FMVLNRKRTI 


101 


YRFSAKHALF 


IFGPFNSIRS LAIRVSVHSL 


FSMFIIGTVI 


INCVFMATGP 


151 


AKNSMSNNTD 


lAECVFTGIY IFEALIKILA 


RGFILDEFSF 


LRDPWNWLDS 


201 


IVIGIAIVSY 


IPGITIKLLP LRTFRVFRAL 


KAISWSRLK 


VIVGAIiLRSV 


251 


KKLVNVIILT 


FFCLSIFALV GQQLFMGSLN 


LKCISRDCKN 


ISNPEAYDHC 


301 


FEKKENSPEF 


KMCGIWMGNS ACSIQYECKH 


TKINPDYNYT 


NFDNFGWSFL 


351 


AMFRLMTQDS 


WEKLYQQTLR TTGLYSVFFF 


IWIFLGSFY 


LINLTLAWT 


401 


MAYEEQNKNV 


AAEIEAKEKM FQEAQQLLKE 


EKEALVAMGI 


DRSSLTSLET 


451 


SYFTPKKRKL 


FGNKKRKSFF LRESGKDQPP 


GSDSDEDCQK 


KPQLLEQTKR 


501 


LSQNLSI.DHF 


DEHGDPLQRQ RALSAVSILT 


ITMKEQEKSQ 


EPCLPCGENL 


551 


ASKYLVWNCC 


PQWLCVKKVL RTVMTDPFTE 


LAITICIIIN 


TVFLAMEHHK 


601 


MEASFEKMLN 


IGNLVFTSIF lAEMCLKIIA 


LDPYHYFRRG 


WNIFDSIVAL 


651 


LSFADVMNCV 


LQKRSWPFLR SFRVLRVFKL 


AKSWPTLNTL 


IKIIGNSVGA 


701 


LGSLTWLVr 


VIFIFSWGM QLFGRSFMSQ 


KSPKLCNPTG 


PTVSCLRHWH 


751 


MGDFWHSFLV 


VFRILCGEWI ENMWECMQEA 


NASSSLCVIV 


FILITVIGKL 


801 


WLNLFIALL 


LNSFSNEERN GNLEGEARKT 


KVQLALDRFR 


RAFCFVRHTL 


851 


EHFCHKWCRK 


QNLPQQKEVA GGCAAQSKDI 


IPLVMEMKRG 


SETQEELGIL 


901 


TSVPKTLGVR 


HDWTWLAPLA EEEDDVEFSG 


EDNAQRITQP 


EPEQQAYELK 


951 


QENKKPTSQR 


VQSVEIDMFS EDEPHLTIQD 


PRKKSDVTSI 


LSECSTIDLQ 


1001 


DGFGWLPEMV 


PKKQPERCLP KGFGCCFPCC 


SVDKRKPPWV 


IWWNLRKTCY 


1051 


QIVKHSWFES 


FIIFVILLSS GALIFEDVHL 


ENQPKIQELL 


NCTDIIFTHI 


1101 


FILEMVLKWV 


AFGFGKYFTS AWCCLDFIIV 


IVSVTTLINL 


MELKSFRTLR 


1151 


AIjRPIiRALSQ 


FEGMKVWNA LIGAIPAILN 


VLLVCLIFWIi 


VFCILGVYFF 
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1201 


SGKFGKCING 


TDSVINYTII 


TNKSQCESGN 


FSWINQKVNF 


DNVGNAYIiAL 


1251 


LQVATFKGWM 


DIIYAAVDST 


EKEQQPEFES 


NSLGYIYFW 


FIIFGSFFTL 


1301 


NliFIGVXIDN 


FNQQQKKLGG 


QDIFMTEEQK 


KYYNAMKKLG 


SKKPQKPIPR 


1351 


PLNKCQGLVF 


DIVTSQIFDI 


IIISLIILNM 


ISMMAESYNQ 


PKAMKSILDH 


1401 


LNWVFWIFT 


LBCLIKIFAL 


RQYYFTNGWN 


LFDCWVLLS 


IVSTMISTLE 


1451 


NQEHIPFPPT 


LFRIVRIARI 


GRILRLVRAA 


RGIRTLLPAL, 


MMSLPSLFNI 


1501 


GliLLFLIMFI 


YAILGMNWFS 


KVNPESGIDD 


IFNFKTFASS 


MLCLFQISTS 


1551 


AGWDSLLSPM 


LRSKESCNSS 


SENCHLPGIA 


TSYFVSYIII 


SFLIWNMYI 


1^01 


AVILENFOTA 


TEESEDPLGE 


DDFDIFYEVW 


EKFDPEATQF 


IKYSALSDFA 


1651 


DALPEPLRVA 


KPNKYQFLVM 


DLPMVSEDRIi 


HCMDILFAFT 


ARVLGGSDGL 


1701 


DSMKAMMEEK 


FMEANPLKKL 


YEPIVTTTKR 


KEEERGAAII 


QKAFRKYMMK 


1751 


VTKGDQGDQN 


DLENGPHSPL 


QTLCNGDLSS 


FGVAKGKVHC 
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<400> 38 

Met Trp Xaa Cys Met Glu Val 
1 5 



<210> 39 
<21X> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: rat NaN 
forward primer 

<400> 39 

ccctgctgcg ctcggtgaag aa 22 

<210> 40 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: a. a. seq. 
used to derive epitope for polyclonal antibody 

<400> 40 

Cys Gly Pro Asn Pro Ala Ser Asn Lys Asp Cys Phe Glu Lys Glu Lys 
1 5 10 15 

Asp Ser Glu Asp 
20 



<210> 41 

<211> 5660 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (31) . . (5403) 

<223> full length cDKA sequence for human NaN 
<400> 41 

atctgctcaa gccaggaatc tcgggtgaag atg gat gac aga tgc tac cca gta 54 

Met Asp Asp Arg Cys Tyr Pro Val 
1 5 

ate ttt cca gat gag egg aat ttc cgc ccc ttc act tec gac tct ctg 102 
He Phe Pro Asp Glu Arg Asn Phe Arg Pro Phe Thr Ser Asp Ser Leu 
10 15 20 

get gca att gag aag egg att gee ate caa aag gag aaa aag aag tct 150 
Ala Ala He Glu Lys Arg He Ala He Gin Lys Glu Lys Lys Lys Ser 
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25 30 35 40 

aaa gac cag aca gga gaa gta ccc cag cct egg cct cag ctt gac eta 198 
Lys Asp Gin Thr Gly Glu Val Pro Gin Pro Arg Pro Gin Leu Asp Leu 
45 50 55 

aag gcc tec agg aag ttg ccc aag etc tat ggc gac att cct cgt gag 246 
Lys Ala Ser Arg Lys Leu Pro Lys Leu Tyr Gly Asp lie Pro Arg Glu 
60 65 70 

etc ata gga aag cct ctg gaa gac ttg gac cca tte tac ega aat eat 294 
Leu lie Gly Lys Pro Leu Glu Asp Leu Asp Pro Phe Tyr Arg Asn His 
75 80 85 

aag aca ttt atg gtg tta aac aga aag agg aca ate tac cgc tte agt 342 
Lys Thr Phe Met Val Leu Asn Arg Lys Arg Thr lie Tyr Arg Phe Ser 
90 95 100 

gee aag cat gcc ttg tte att ttt ggg cct tte aat tea ate aga agt 390 
Ala Lys His Ala Leu Phe lie Phe Gly Pro Phe Asn Ser lie Arg Ser 
105 110 115 120 

tta gcc att aga gtc tea gtc cat tea ttg tte age atg tte att ate 438 
Leu Ala lie Arg Val Ser Val His Ser Leu Phe Ser Net Phe He He 
125 130 135 

ggc acc gtt ate ate aac tgc gtg tte atg get aca ggg cct get aaa 486 
Gly Thr Val He He Asn Cys Val Phe Met Ala Thr Gly Pro Ala Lys 
140 145 150 

aac age aac agt aac aat act gac att gca gag tgt gtc tte act ggg 534 
Asn Ser Asn Ser Asn Asn Thr Asp He Ala Glu Cys Val Phe Thr Gly 
155 160 165 

att tat att ttt gaa get ttg att aaa ata ttg gca aga ggt tte att 582 
He Tyr He Phe Glu Ala Leu He Lys He Leu Ala Arg Gly Phe He 
170 175 180 

ctg gat gag ttt tct tte ctt cga gat cca tgg aac tgg ctg gac tec 630 
Leu Asp Glu Phe Ser Phe Leu Arg Asp Pro Trp Asn Trp Leu Asp Ser 
185 190 195 200 

att gtc att gga ata geg att gtg tea tat att cca gga ate ace ate 678 
He Val He Gly He Ala He Val Ser Tyr He Pro Gly He Thr He 
205 210 215 

aaa eta ttg ccc ctg cgt ace tte cgt gtg tte aga get ttg aaa gca 726 
Lys Leu Leu Pro Leu Arg Thr Phe Arg Val Phe Arg Ala Leu Lys Ala 
220 225 230 

att tea gta gtt tea cgt ctg aag gtc ate gtg ggg gcc ttg eta cgc 774 
He Ser Val Val Ser Arg Leu Lys Val He Val Gly Ala Leu Leu Arg 
235 240 245 

tct gtg aag aag ctg gtc aac gtg att ate etc ace tte ttt tgc etc 822 
Ser Val Lys Lys Leu Val Asn Val He He Leu Thr Phe Phe Cys Leu 
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250 255 260 

age ate ttt gcc ctg gta ggt cag cag etc ttc atg gga agt etg aae 870 
Ser IXe Phe Ala Xieu Val Gly Gin Gin Leu Phe Met Gly Ser Leu Asn 
265 270 275 280 

ctg aaa tgc ate teg agg gac tgt aaa aat ate agt aae eeg gaa get 918 
Leu Lys Cys He Ser Arg Asp Cys Lys Asn He Ser Asn Pro Glu Ala 
285 290 295 

tat gac eat tgc ttt gaa aag aaa gaa aat tea cat gaa ttc aaa atg 966 
Tyr Asp His Cys Phe Glu Lys Lys Glu Asn Ser Pro Glu Phe Lys Met 
300 305 310 

tgt ggc ate tgg atg ggt aae agt gee tgt tec ata eaa tat gaa tgt 1014 
Cys Gly He Trp Met Gly Asn Ser Ala Cys Ser He Gin Tyr Glu Cys 
315 320 325 

aag cae ace aaa att aat eet gac tat aat tat acg aat ttt gac aae 1062 
Lys His Thr Lys He Asn Pro Asp Tyr Asn Tyr Thr Asn Phe Asp Asn 
330 335 340 

ttt ggc tgg tct ttt ctt gcc atg ttc egg ctg atg acc eaa gat tec 1110 
Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Leu Met Thr Gin Asp Ser 
345 350 355 360 

^99 9^9 d^g ett tat eaa cag acc ctg egt act act ggg etc tac tea 1158 
Trp Glu Lys Leu Tyr Gin Gin Thr Leu Arg Thr Thr Gly Leu Tyr Ser 
365 370 375 

gtc ttc ttc tte att gtg gte att ttc etg ggc tec ttc tac ctg att 1206 
Val Phe Phe Phe He Val Val He Phe Leu Gly Ser Phe Tyr Leu He 
380 385 390 

aae tta ace etg get gtt gtt acc atg gca tat gag gag cag aac aag 1254 
, Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu Gin Asn Lys 
395 400 405 

aat gta get gca gag ata gag gee aag gaa aag atg ttt cag gaa gee 1302 
Asn Val Ala Ala Glu He Glu Ala Lys Glu Lys Met Phe Gin Glu Ala 
410 415 420 

cag cag ctg tta aag gag gaa aag gag get etg gtt gee atg gga att 1350 
Gin Gin Leu Leu Lys Glu Glu Lys Glu Ala Leu Val Ala Met Gly He 
425 43.0 435 440 

gac aga agt tea ctt act tec ctt gaa aca tea tat ttt acc cca aaa 1398 
Asp Arg Ser Ser Leu Thr Ser Leu Glu Thr Ser Tyr Phe Thr Pro Lys 
445 450 455 

aag aga aag etc ttt ggt aat aag aaa agg aag tec ttc ttt ttg aga 1446 
Lys Arg Lys Leu Phe Gly Asn Lys Lys Arg Lys Ser Phe Phe Leu Arg 
460 465 470 

gag tct ggg aaa gac cag eet ect ggg tea gat tct gat gaa gat tgc 1494 
Glu Ser Gly Lys Asp Gin Pro Pro Gly Ser Asp Ser Asp Glu Asp Cys 
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475 480 485 

caa aaa aag cca cag etc eta gag caa aec aaa cga ctg tec cag aat XS42 
Gin Lys Lys Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu Ser Gin Asn 
490 495 500 

eta tea ctg gac cac ttt gat gag cat gga gat cct etc caa agg cag 1590 
Leu Ser Leu Asp His Phe Asp Glu His Gly Asp Pro Leu Glii Arg Gin 
505 510 515 520 

aga gca ctg agt get gtc age ate etc ace ate ace atg aag gaa caa 1638 
Arg Ala Leu Ser Ala Val Ser lie Leu Thr lie Thr Met Lys Glu Gin 
525 530 535 

gaa aaa tea caa gag cct tgt etc cct tgt gga gaa aac ctg gca tec 1686 
Glu Lys Ser Gin Glu Pro Cys Leu Pro Cys Gly Glu Asn Leu Ala Ser 
540 545 550 

aag tac etc gtg tgg aac tgt tgc cec cag tgg ctg tgc gtt aag aag 1734 
Lys Tyr Leu Val Trp Asn Cys Cys Pro Gin Trp Leu Cys Val Lys Lys 

555 560 565 

gtc Ctg aga act gtg atg act gac ccg ttt act gag ctg gee ate ace 1782 
Val Leu Arg Thr Val Met Thr Asp Pro Phe Thr Glu Leu Ala He Thr 
570 575 580 

ate tgc ate ate ate aac act gtc ttc ttg gee atg gag cat cac aag 1830 
He Cys He lie ile Asn Thr Val Phe Leu Ala Met Glu His His Lys 
585 590 595 600 

atg gag gee agt ttt gag aag atg ttg aat ata ggg aat ttg gtt ttc 1878 
Met Glu Ala Ser Phe Glu Lys Met Leu Asn He Gly Asn Leu Val Phe 
605 610 615 

act age att ttt ata gca gaa atg tgc eta aaa ate att gcg etc gat 1926 
Thr Ser He Phe He Ala Glu Met Cys Leu Lys He He Ala Leu Asp 
620 625 630 

cec tac cac tac ttt cgc cga ggc tgg aac att ttt gac age att gtt 1974 
Pro Tyr His Tyr Phe Arg Arg Gly Trp Asn He Phe Asp Ser He Val 
635 640 645 

get ctt Ctg agt ttt gca gat gta atg aac tgt gta ctt caa aag aga 2022 
Ala Leu Leu Ser Phe Ala Asp Val Met Asn Cys Val Leu Gin Lys Arg 
650 655 660 

age tgg cca ttc ttg cgt tec ttc aga gtg etc agg gtc ttc aag tta 2070 
Ser Trp Pro Phe Leu Arg Ser Phe Arg Val Leu Arg Val Phe Lys Leu 
665 670 675 680 

gcc aaa tec tgg cca act ttg aac aca eta att aag ata ate ggc aac 2118 
Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He He Gly Asn 
685 690 695 

tet gtc gga gcc ctt gga age ctg act gtg gtc ctg gtc att gtg ate 2166 
Ser Val Gly Ala Leu Gly Ser Leu Thr Val Val Leu Val He Val He 
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700 705 710 

ttt att ttc tea gta gtt ggc atg cag ctt ttt ggc cgt age ttc aat 2214 
Phe lie Phe Ser Val Val Gly Met Gin Leu Phe Gly Arg Ser Phe Asn 
715 720 725 

tec caa aag agt cca aaa etc tgt aac ceg aca ggc ccg aca gtc tea 2262 
Ser Gin Lys Ser Pro Lys Leu Cys Asn Pro Thr Gly Pro Thr Val Ser 
730 735 740 

tgt tta egg eac tgg cae atg ggg gat ttc tgg eac tec ttc eta gtg 2310 
Cys Leu Arg His Trp His Met Gly Asp Phe Trp His Ser Phe Leu Val 
745 750 755 760 

gta ttc ego ate etc tgc ggg gaa tgg ate gaa aat atg tgg gaa tgt 2358 
Val Phe Arg lie Leu Cys Gly Glu Trp lie Glu Asn Met Trp Glu Cys 
765 770 775 

atg caa gaa geg aat gca tea tea tea ttg tgt gtt att gtc ttc ata 2406 
Met Gin Glu Ala Asn Ala Ser Ser Ser Leu Cys Val lie Val Phe lie 
780 785 790 

ttg ate acg gtg ata gga aaa ctt gtg gtg etc aac etc ttc att gcc 2454 
Leu lie Thr Val lie Gly Lys Leu Val Val Leu Asn Leu Phe lie Ala 
795 800 805 

tta etg etc aat tec ttt age aat gag gaa aga aat gga aac tta gaa 2502 
Leu Leu Leu Asn Ser Phe Ser Asn Glu Glu Arg Asn Gly Asn Leu Glu 
810 815 820 

gga gag gee agg aaa act aaa gtc cag tta gca etg gat ega ttc cge 2550 
Gly Glu Ala Arg Lys Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Arg 
825 830 835 840 

egg get ttt tgt ttt gtg aga eac act ctt gag cat ttc tgt cac aag 2598 
Arg Ala Phe Cys Phe Val Arg His Thr Leu Glu His Phe Cys His Lys 
845 850 855 

tgg tgc agg aag caa aac tta cca cag caa aaa gag gtg gca gga ggc 2646 
Trp Cys Arg Lys Gin Asn Leu Pro Gin Gin Lys Glu Val Ala Gly Gly 
860 865 870 

tgt get gca caa age aaa gac ate att ece etg gtc atg gag atg aaa 2694 
Cys Ala Ala Gin Ser Lys Asp lie lie Pro Leu Val Met Glu Met Lys 
875 880 885 

agg ggc tea gag ace eag gag gag ctt ggt ata eta acc tct gta cca 2742 
Arg Gly Ser Glu Thr Gin Glu Glu Leu Gly lie Leu Thr Ser Val Pro 
890 895 900 

aag acc etg ggc gtc agg cat gat tgg act tgg ttg gca cca ctt gcg 2790 
Lys Thr Leu Gly Val Arg His Asp Trp Thr Trp Leu Ala Pro Leu Ala 
905 910 915 920 

gag gag gaa gat gac gtt gaa ttt tct ggt gaa gat aat gca eag cge 2838 
Glu Glu Glu Asp Asp Val Glu Phe Ser Gly Glu Asp Asn Ala Gin Arg 
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925 930 935 

ate aca caa cct gag cct gaa caa cag gcc tat gag etc cat cag gag 2886 
lie Thr Oln Pro Glu Pro Glu Gin Gin Ala Tyr Glu Leu His Gin Glu 
940 945 950 

aac aag aag ccc acg age cag aga gtt caa agt gtg gaa att gac atg 2934 
Asn Lys Lya Pro Thr Ser Gin Arg Val Gin Ser Val Glu lie Asp Met 
955 960 965 

ttc tot gaa gat gag cct cat ctg ace ata cag gat ccc cga aag aag 2982 
Phe Ser Glu Asp Glu Pro His Leu Thr He Gin Asp Pro Arg Lys Lys 
970 975 980 

tct gat gtt ace agt ata eta tea gaa tgt age aee att gat ctt cag 3030 
Ser Asp Val Thr Ser He Leu Ser Glu Cys Ser Thr He Asp Leu Gin 
985 990 995 1000 

gat ggc ttt gga tgg tta cct gag atg gtt ccc aaa aag caa cca gag 3078 
Asp Gly Phe Gly Trp Leu Pro Glu Met Val Pro Lys Lys Gin Pro Glu 
1005 1010 1015 

aga tgt ttg ccc aaa ggc ttt ggt tgc tgc ttt cca tgc tgt age gtg 3126 
Arg Cys Leu Pro Lys Gly Phe Gly Cys Cys Phe Pro Cys Cys Ser Val 
1020 1025 1030 

gac aag aga aag cct ccc tgg gte att tgg tgg aac ctg egg aaa acc 3174 
Asp Lys Arg Lys Pro Pro Trp Val He Trp Trp Asn Leu Arg Lys Thr 
1035 1040 1045 

tgc tac caa ata gtg aaa cac age tgg ttt gag age ttt att ate ttt 3222 
Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He He Phe 
1050 1055 1060 

gtg att ctg ctg age agt ggg gca ctg ata ttt gaa gat gtt cac ctt 3270 
Val He Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val His Leu 
1065 1070 1075 1080 

gag aac caa ccc aaa ate caA gaa tta eta aat tgt act gac att att 3318 
Glu Asn Gin Pro Lys He Gin Glu Leu Leu Asn Cys Thr Asp He He 
1085 1090 1095 

ttt aca cat att ttt ate ctg gag atg gta eta aaa tgg gta gee ttc 3366 
Phe Thr His He Phe He Leu Glu Met Val Leu Lys Trp Val Ala Phe 
1100 1105 1110 

gga ttt gga aag tat ttc acc agt gee tgg tgc tgc ctt gat ttc ate 3414 
Gly Phe Gly Lys Tyr Phe Thr Ser Ala Trp Cys Cys Leu Asp Phe He 
1115 1120 1125 

att gtg att gtc tct gtg acc acc etc att aac tta atg gaa ttg aag 3462 
He Val He Val Ser Val Thr Thr Leu He Asn Leu Met Glu Leu Lys 
1130 1135 1140 

tec ttc egg act eta cga gca ctg agg cct ctt cgt gcg ctg tec cag 3510 
Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser Gin 
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1145 1150 1155 1160 

ttt gaa gga atg aag gtg gtg gtc aat get etc ata ggt gee ata act 3558 
Phe Glu Gly Met Lys Val Val Val Asn Ala Leu lie Gly Ala lie Pro 
1165 1170 1175 

gcc att Gtg aat gtt ttg ctt gtc tgc etc att ttc tgg etc gta ttt 3606 
Ala lie Leu Asn Val Leu Leu Val Cys Leu He Phe Trp Leu Val Phe 
1180 1185 1190 

tgt att ctg gga gta tac ttc ttt tct gga aaa ttt ggg aaa tgc att 3554 
Cys He Leu Gly Val Tyr Phe Phe Ser Gly Lys Phe Gly Lys Cys He 
1195 1200 1205 

aat gga aca gac tea gtt ata aat tat ace ate att aca aat aaa agt 3702 
Asn Gly Thr Asp Ser Val He Asn Tyr Thr He He Thr Asn Lys Ser 
1210 1215 1220 

caa tgt gaa agt gge aat ttc tct tgg ate aac cag aaa gtc aac ttt 3750 
Gin Cys Glu Ser Gly Asn Phe Ser Trp He Asn Gin Lys Val Asn Phe 
1225 1230 1235 1240 

gac aat gtg gga aat get tac etc get ctg ctg caa gtg gca aca ttt 3798 
Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin Val Ala Thr Phe 
1245 1250 1255 

aag gge tgg atg gat att ata tat gca get gtt gat tec aca gag aaa 3845 
Lys Gly Trp Met Asp He He Tyr Ala Ala Val Asp Ser Thr Glu Lys 
1260 1265 1270 

gaa caa cag cca gag ttt gag age aat tea etc ggt tac att tac ttc 3894 
Glu Gin Gin Pro Glu Phe Glu Ser Asn Ser Leu Gly Tyr He Tyr Phe 
1275 1280 1285 

gta gtc ttt ate ate ttt gge tea ttc ttc act ctg aat etc ttc att 3942 
Val Val Phe He He Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He 
1290 1295 1300 

gge gtt ate att gac aac ttc aac caa cag cag aaa aag tta ggt gge 3990 
Gly Val He He Asp Asn Phe Asn Gin Gin Gin Lys Lys Leu Gly Gly 
1305 1310 1315 1320 

caa gac att ttt atg aca gaa gaa cag aag aaa tac tat aat gca atg 4038 
Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr Tyr Asn Ala Met 
1325 1330 1335 

aaa aaa tta gga tec aaa aaa cct caa aaa ecc att cca egg cet ctg 4086 
Lys Lys Leu Gly Ser Lys Lys Pro Gin Lys Pro He Pro Arg Pro Leu 
1340 1345 1350 

aac aaa tgt caa ggt etc gtg ttc gac ata gtc aca age cag ate ttt 4134 
Asn Lys Cys Gin Gly Leu Val Phe Asp He Val Thr Ser Gin He Phe 
1355 1360 1365 

gac ate ate ate ata agt etc att ate eta aac atg att age atg atg 4182 
Asp He He He He Ser Leu He He Leu Asn Met He Ser Met Met 
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1370 1375 1380 

get gaa tea tac aac caa ccc aaa gcc atg aaa tec ate ctt gac cat 4230 
Ala Glu Ser Tyr Asn Gin Pro Lys Ala Met Lys Ser lie Leu Asp His 
1385 1390 1395 1400 

etc aac tgg gtc ttt gtg gtc ate ttt acg tta gaa tgt etc ate aaa 4278 
Leu Asn Trp Val Phe Val Val lie Phe Thr Leu Glu Cys Leu lie Lys 
1405 1410 1415 

ate ttt get ttg agg caa tac tac ttc ace aat gge tgg aat tta ttt 4326 
lie Phe Ala Leu Arg Gin Tyr Tyr Phe Thr Asn Gly Trp Asn he\x Phe 
1420 1425 1430 

gac tgt gtg gtc gtg ctt ctt tec att gtt agt aca atg att tct ace 4374 
Asp Cys Val Val Val Leu Leu Ser lie Val Ser Thr Met lie Ser Thr 
1435 1440 1445 

ttg gaa aat eag gag cae att ect ttc cet ccg acg etc tte aga att 4422 
Leu Glu Asn Gin Glu His lie Pro Phe Pro Pro Thr Leu Phe Arg lie 
1450 1455 1460 

gtc cgc ttg get egg att gge cga ate ctg agg ctt gtc egg get gca 4470 
Val Arg Leu Ala Arg He Gly Arg He Leu Arg Leu Val Arg Ala Ala 
1465 1470 1475 1480 

cga gga ate agg act etc etc ttt get ctg atg atg teg ctt cet tct 4518 
Arg Gly lie Arg Thr Leu Leu Phe Ala Leu Met Met Ser Leu Pro Ser 
1485 1490 1495 

ctg ttc aac att ggt ctt eta etc ctt ctg att atg ttt ate tat gcc 4566 
Leu Phe Asn He Gly Leu Leu Leu Phe Leu He Met Phe He Tyr Ala 
1500 1505 1510 

att ctg ggt atg aac tgg ttt tec aaa gtg aat eca gag tct gga ate 4614 
He Leu Gly Met Asn Trp Phe Ser Lys Val Asn Pro Glu Ser Gly He 
1515 1520 1525 

gat gac ata ttc aac ttc aag act ttt gcc age age atg etc tgt etc 4662 
Asp Asp He Phe Asn Phe Lys Thr Phe Ala Ser Ser Met Leu Cys Leu 
1530 1535 1540 

ttc cag ata age aca tea gca ggt tgg gat tec ctg etc age ccc atg 4710 
Phe Gin He Ser Thr Ser Ala Gly Trp Asp Ser Leu Leu Ser Pro Met 
1545 1550 1555 1560 

ctg cga tea aaa gaa tea tgt aac tct tec tea gaa aac tgc cac etc .4758 
Leu Arg Ser Lys Glu Ser Cys Asn Ser Ser Ser Glu Asn Cys His Leu 
1565 1570 1575 

cet gge ata gcc aca tec tac ttt gtc agt tac att ate ate tec ttt 4806 
Pro Gly He Ala Thr Ser Tyr Phe Val Ser Tyr He He He Ser Phe 
1580 1585 1590 

etc att gtt gtc aac atg tac att get gtg att tta gag aac ttc aat 4854 
Leu He Val Val Asn Met Tyr He Ala Val He Leu Glu Asn Phe Asn 
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1595 1600 1605 

aca gcc act gaa gaa agt gag gac cct ttg ggt gaa gat gac ttt gac 4902 
Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu Asp Asp Phe Asp 
1610 1615 1620 

ata ttt tat gaa gtg tgg gaa aag ttt gac cca gaa gca aca caa ttt 4950 
lie Phe Tyr Glu Val Trp Glu Lys Phe Asp Pro Glu Ala Thr Gin Phe 
1625 1630 1635 1640 

ate aaa tat tct gcc ctt tct gac ttt get gat gcc ttg cct gag cct 4998 
lie Lys Tyr Ser Ala Leu Ser Asp Phe Ala Asp Ala Leu Pro Glu Pro 
1645 1650 1655 

ttg cgt gtc gca aag cca aat aaa tat caa ttt eta gta atg gac ttg 5046 
Leu Arg Val Ala Lys Pro Asn Lys Tyr Gin Phe Leu Val Met Asp Leu 
1660 1665 1670 

ccc atg gtg agt gaa gat cgc etc cac tgc atg gat att ctt ttc gcc 5094 
Pro Met val Ser Glu Asp Arg Leu His Cys Met Asp He Leu Phe Ala 
1675 1680 1685 

ttc ace get agg gta etc ggt ggc tct gat ggc eta gat agt atg aaa 5142 
Phe Thr Ala Arg Val Leu Gly Gly Ser Asp Gly Leu Asp Ser Met Lys 
1690 1695 1700 

gca atg atg gaa gag aag ttc atg gaa gcc aat cct etc aag aag ttg 5190 
Ala Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro Leu Lys Lys Leu 
1705 1710 1715 1720 

tat gaa ccc ata gtc ace ace ace aag aga aag gaa gag gaa aga ggt 5238 
Tyr Glu Pro He Val Thr Thr Thr Lys Arg Lys Glu Glu Glu Arg Gly 
1725 1730 1735 

get get att att caa aag gcc ttt ega aag tac atg atg aag gtg ace 5286 
Ala Ala He He Gin Lys Ala Phe Arg Lys Tyr Met Met Lys Val Thr 
1740 1745 1750 

aag ggt gac caa ggt gac caa aat gac ttg gaa aac ggg cct cat tea 5334 
Lys Gly Asp Gin Gly Asp Gin Asn Asp Leu Glu Asn Gly Pro His Ser 
1755 1760 1765 

cca etc cag act ctt tgc aat gga gac ttg tct age ttt ggg gtg gee 5382 
Pro Leu Gin Thr Leu Cys Asn Gly Asp Leu Ser Ser Phe Gly Val Ala 
1770 1775 1780 

aag ggc aag gtc cac tgt gac tgagcectea cetceacgec tacctcatag 5433 
Lys Gly Lys Val His Cys Asp 
1785 1790 

cttcacagcc ttgccttcag ectetgagct ceaggggtea gcagcttagt gtatcaacag 5493 
ggagtggatt caecaaatta gecattccat tttcttttct ggctaaaata aatgatattt 5553 
caatttcatt ttaaatgata cttacagaga tataagataa ggctaettga caaceagtgg 5613 
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tactattata ataaggaaga agacaccagg aaggactgta aaaggacata ccaattttag 5673 

gattgaaata gttcaggccg ggcgcagtgg ctcatgcctg taatcccagc actttgagag 5732 

gccaaggcag gtggatcacg aggtcaagag atcgagacca tcctggccaa catgatgaaa 5793 

ctccgtctct ctaaaaatac aaaaattagc tgggcatggt ggcgtgcgcc tgtagtccca S853 

ctapttg 5860 



<210> 42 
<211> 1791 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Met Asp Asp Arg Cys Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe 
15 10 15 

Arg Pro Phe Thr Ser Asp Ser Leu Ala Ala lie Glu Lys Arg lie Ala 
20 25 30 

lie Gin Lys Glu Lys Lys Lys Ser Lys Asp Gin Thr Gly Glu Val Pro 
35 40 45 

Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 
50 55 60 

Leu Tyr Gly Asp lie Pro Arg Glu Leu lie Gly Lys Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Phe Tyr Arg Asn His Lys Thr Phe Met Val Leu Asn Arg 
85 90 95 

Lys Arg Thr He Tyr Arg Phe Ser Ala Lys His Ala Leu Phe He Phe 
100 105 110 

Gly Pro Phe Asn Ser He Arg Ser Leu Ala He Arg. Val Ser Val His 
115 120 125 

Ser Leu Phe Ser Met Phe He He Gly Thr Val He He Asn Cys Val 
130 135 140 

Phe Met Ala Thr Gly Pro Ala Lys Asn Ser Asn Ser Asn Asn Thr Asp 
145 150 155 160 

He Ala Glu Cys Val Phe Thr Gly He Tyr He Phe Glu Ala Leu He 
165 170 175 

Lys He Leu Ala Arg Gly Phe He Leu Asp Glu Phe Ser Phe Leu Arg 
180 185 190 

Asp Pro Trp Asn Trp Leu Asp Ser He Val He Gly He Ala He Val 
195 200 205 
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Ser Tyr He Pro Gly He Thr lie Lys Leu Leu Pro Leu Arg Thr Phe 
210 215 220 

Arg Val Phe Arg Ala Leu Lys Ala He Ser Val Val Ser Arg Leu Lys 
225 230 235 240 

Val He Val Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asn Val 
245 250 255 

He He Leu Thr Phe Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin 
260 265 270 

Gin Leu Phe Met Gly Ser Leu Asn Leu Lys Cys He Ser Arg Asp Cys 
275 280 285 

Lys Asn He Ser Asn Pro Glu Ala Tyr Asp His Cys Phe Glu Lys Lys 
290 295 300 

Glu Asn Ser Pro Glu Phe Lys Met Cys Gly He Trp Met Gly Asn Ser 
305 310 315 320 

Ala cys Ser He Gin Tyr Glu Cys Lys His Thr Lys He Asn Pro Asp 
325 330 335 

Tyr Asn Tyr Thr Asn Phe Asp Asn Phe Gly Trp Ser Phe Leu Ala Met 
340 345 350 

Phe Arg Leu Met Thr Gin Asp Ser Trp Glu Lys Leu Tyr Gin Gin Thr 
355 360 365 

Leu Arg Thr Thr Gly Leu Tyr Ser Val Phe Phe Phe He Val Val He 
370 375 380 

Phe Leu Gly Ser Phe Tyr Leu He Asn Leu Thr Leu Ala Val Val Thr 
385 390 395 400 

Met Ala Tyr Glu Glu Gin Asn Lys Asn Val Ala Ala Glu He Glu Ala 
405 410 415 

Lys Glu Lys Met Phe Gin Glu Ala Gin Gin Leu Leu Lys Glu Glu Lys 
420 425 430 

Glu Ala Leu Val Ala Met Gly He Asp Arg Ser Ser Leu Thr Ser Leu 
435 440 445 

Glu Thr Ser Tyr Phe Thr Pro Lys Lys Arg Lys Leu Phe Gly Asn Lys 
450 455 460 

Lys Arg Lys Ser Phe Phe Leu Arg Glu Ser Gly Lys Asp Gin Pro Pro 
465 470 475 480 

Gly Ser Asp Ser Asp Glu Asp Cys Gin Lys Lys Pro Gin Leu Leu Glu 
485 490 495 

Gin Thr Lys Arg Leu Ser Gin Asn Leu Ser Leu Asp His Phe Asp Glu 
500 505 510 
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His Gly Asp Pro Leu Gin Arg Gin Arg Ala Leu Ser Ala Val Ser lie 
515 . 520 525 

Leu Thr lie Thr Met Lys Glu Gin Glu Lys Ser Gin Glu Pro Cys Leu 
530 535 540 

Pro Cys Gly Glu Asn Leu Ala Ser Lys Tyr Leu Val Trp Asn Cys Cys 
545 550 555 560 

Pro Gin Trp Leu Cys Val Lys Lys Val Leu Arg Thr Val Met Thr Asp 
565 570 575 

Pro Phe Thr Glu Leu Ala lie Thr lie Cys lie He He Asn Thr Val 
580 585 590 

Phe Leu Ala Met Glu His His Lys Met Glu Ala Ser Phe Glu Lys Met 
595 600 605 

Leu Asn He Gly Asn Leu Val Phe Thr Ser He Phe He Ala Glu Met 
610 615 620 

Cys Leu Lys He He Ala Leu Asp Pro Tyr His Tyr Phe Arg Arg Gly 
625 630 635 640 

Trp Asn He Phe Asp Ser He Val Ala Leu Leu Ser Phe Ala Asp Val 
645 650 655 

Met Asn Cys Val Leu Gin Lys Arg Ser Trp Pro Phe Leu Arg Ser Phe 
660 665 670 

Arg Val Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn 
675 680 685 

Thr Leu He Lys He He Gly Asn Ser Val Gly Ala Leu Gly Ser Leu 
690 695 700 

Thr Val Val Leu Val He Val He Phe He Phe Ser Val Val Gly Met 
705 710 7X5 720 

Gin Leu Phe Gly Arg Ser Phe Asn Ser Gin Lys Ser Pro Lys Leu Cys 
725 730 735 

Asn Pro Thr Gly Pro Thr Val Ser Cys Leu Arg His Trp His Met Gly 
740 745 750 

Asp Phe Trp His Ser Phe Leu Val Val Phe Arg He Leu Cys Gly Glu 
755 760 765 

Trp He Glu Asn Met Trp Glu Cys Met Gin Glu Ala Asn Ala Ser Ser 
770 775 780 

Ser Leu Cys Val He Val Phe He Leu He Thr Val He Gly Lys Leu 
785 790 795 800 

Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser Phe Ser Asn 
805 810 815 
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Qlu Glu Arg Aan Gly Asn Leu Glu Gly Glu Ala Arg Lys Thr Lys Val 
820 825 830 

Gin Leu Ala Leu Asp Arg Phe Arg Arg Ala Phe Cys Phe Val Arg His 
835 840 845 

Thr Leu Glu His Phe Cys His Lys Trp Cys Arg Lys Gin Asn Leu Pro 
850 855 860 

Gin Gin Lys Glu Val Ala Gly Gly Cys Ala Ala Gin Ser Lys Asp lie 
865 870 875 880 

He Pro Leu Val Met Glu Met Lys Arg Gly Ser Glu Thr Gin Glu Glu 
885 890 895 

Leu Gly He Leu Thr Ser Val Pro Lys Thr Leu Gly Val Arg His Asp 
900 905 9X0 

Tro Thr Trp Leu Ala Pro Leu Ala Glu Glu Glu Asp Asp Val Glu Phe 
915 920 925 

Ser Gly Glu Asp Asn Ala Gin Arg He Thr Gin Pro Glu Pro Glu Gin 
930 935 940 

Gin Ala Tyr Glu Leu His Gin Glu Asn Lys Lys Pro Thr Ser Gin Arg 
945 950 955 960 

Val Gin Ser Val Glu He Asp Met Phe Ser Glu Asp Glu Pro His Leu 
965 970 975 

Thr He Gin Asp Pro Arg Lys Lys Ser Asp Val Thr Ser He Leu Ser 
980 985 990 

Glu Cys ser Thr He Asp Leu Gin Asp Gly Phe Gly Trp Leu Pro Glu 
995 1000 1005 

Met Val Pro Lys Lys Gin Pro Glu Arg Cys Leu Pro Lys Gly Phe Gly 
1010 1015 1020 

CYS Cys Phe Pro Cys Cys Ser Val Asp Lys Arg Lys Pro Pro Trp Val 
1^25 1030 1035 1040 

He Trp Trp Asn Leu Arg Lys Thr Cys Tyr Gin He Val Lys His Ser 
1045 1050 1055 

Trp Phe Glu Ser Phe He He Phe val He Leu Leu Ser Ser Gly Ala 
1060 1065 1070 

Leu He Phe Glu Asp Val His Leu Glu Asn Gin Pro Lys He Gin Glu 
1075 1080 1085 

Leu Leu Asn Cys Thr Asp He He Phe Thr His He Phe He Leu Glu 
1090 1095 1100 

Met val Leu Lys Trp Val Ala Phe Gly Phe Gly Lys Tyr Phe Thr Ser 
1105 1110 1115 
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Ala Trp Cys Cys Leu Asp Phe He He Val He Val Ser Val Thr Thr 
1125 1130 113S 

Leu He Asn Leu Met Glu Leu Lya Ser Phe Arg Thr Leu Arg Ala Leu 
1140 1145 1150 

Arg Pro Leu Arg Ala Leu Ser Gin Phe Glu Gly Met Lys Val Val Val 
1155 1160 1165 

Asn Ala Leu He Gly Ala He Pro Ala He Leu Asn Val Leu Leu Val 
1170 1175 1180 

Cys Leu He Phe Trp Leu Val Phe Cys He Leu Gly Val Tyr Phe Phe 
lies 1190 1200 

Ser Gly Lys Phe Gly Lys Cys He Asn Gly Thr Asp Ser Val He Asn 
1205 1210 "1.5 

Tyr Thr He He Thr Asn Lys Ser Gin Cys Glu Ser Gly Asn Phe Ser 
1220 1225 1230 

Trp He Asn Gin Lys Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu 
1235 1240 1245 

Ala Leu Leu Gin Val Ala Thr Phe Lys Gly Trp Met Asp He He Tyr 
1250 1255 1260 

Ala Ala Val Asp Ser Thr Glu Lys Glu Gin Gin Pro Glu Phe Glu Ser 
1265 1270 1275 1280 

Asn ser Leu Gly Tyr He Tyr Phe Val Val Phe He He Phe Gly Ser 
1285 1290 1295 

Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn 
1300 1305 1310 

Gin Gin Gin Lys Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu 
1315 1320 1325 

Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Leu Gly Ser Lys Lys Pro 
1330 1335 1340 

Gin Lys Pro He Pro Arg Pro Leu Asn Lys Cys Gin Gly Leu Val Phe 
1345 1350 1355 "60 

ASP He val Thr Ser Gin He Phe Asp He He He He Ser Leu He 
1365 1370 1375 

He Leu Asn Met He Ser Met Met Ala Glu Ser Tyr Asn Gin Pro Lys 
1380 1385 1390 

Ala Met Lys Ser He Leu Asp His Leu Asn Trp Val Phe Val Val He 
1395 1400 1405 

Phe Thr Leu Glu Cys Leu He Lys He Phe Ala Leu Arg Gin Tyr Tyr 
1410 1415 1420 

71 



wo 01/05831 



PCT/USOO/19342 



Phe Thr Asn Gly Trp Asn Leu Phe Asp Cys Val Val Val Leu Leu Ser 
1425 1430 1435 1440 

lie Val Ser Thr Met lie Ser Thr Leu Glu Aszi Gin Glu His lie Pro 
1445 1450 1455 

Phe Pro Pro Thr Leu Phe Arg He Val Arg Leu Ala Arg He. Gly Arg 
1460 1465 1470 

He Leu Arg Leu Val Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe 
1475 1480 1485 

Ala Leu Met Met Ser Leu Pro Ser Leu Phe Asn He Qly Leu Leu Leu 
1490 1495 1500 

Phe Leu He Met Phe He Tyr Ala He Leu Gly Met Asn Trp Phe Ser 
1505 1510 1515 1520 

Lys Val Asn Pro Glu Ser Gly He Asp Asp He Phe Asn Phe Lys Thr 
1525 1530 1535 

Phe Ala Ser Ser Met Leu Cys Leu Phe Gin He Ser Thr Ser Ala Gly 
1540 1545 1550 

Trp Asp Ser Leu Leu Ser Pro Met Leu Arg Ser Lys Glu Ser Cys Asn 
1555 1560 1565 

Ser Ser Ser Glu Asn Cys His Leu Pro Gly He Ala Thr Ser Tyr Phe 
1570 1575 1580 

Val Ser Tyr He He He Ser Phe Leu He Val Val Asn Met Tyr He 
1585 1590 1595 1600 

Ala Val He Leu Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu Asp 
1605 1610 1615 

Pro Leu Gly Glu Asp Asp Phe Asp He Phe Tyr Glu Val Trp Glu Lys 
1620 1625 1630 

Phe Asp Pro Glu Ala Thr Gin Phe He Lys Tyr Ser Ala Leu Ser Asp 
1635 1640 1645 

Phe Ala Asp Ala Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Lys 
1650 1655 1660 

Tyr Gin Phe Leu Val Met Asp Leu Pro Met Val Ser Glu Asp Arg Leu 
1665 1670 1675 1680 

His Cys Met Asp He Leu Phe Ala Phe Thr Ala Arg Val Leu Gly Gly 
1685 1690 1695 

Ser Asp Gly Leu Asp Ser Met Lys Ala Met Met Glu Glu Lys Phe Met 
1700 1705 1710 

Glu Ala Asn Pro Leu Lys Lys Leu Tyr Glu Pro He Val Thr Thr Thr 
1715 1720 1725 
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Lys Lys Glu 01« Glu Arg Gly Ala Ala He lie Gin Lys Ala Phe 
1730 ^'^^5 



;^ .ys Tyr Met Met Lys Val Thr Lys Gly^Asp Gin Gly Asp Gln^Asn 
1745 ^''5° 

Olu OX, Ki. =e. ,.o >^ Oln «u or. aly 

1765 

.eu Ser Ser Phe Gly Val Ala Lys Gly Lys Val His^Cys Asp 



1780 ^'^^^ 



<210> 43 
<211> 8 
<212> PRT 

<213> Artificial Sequence 



'AIT> Description of Artificial Sequence: protein seq. 
basis for rat NaN reverse primer no. 5 

<400> 43 

Ala Trp Cys Trp Leu Asp Phe Leu 
1 5 



<210> 44 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> human NaN reverse primer 

<400> 44 23 
gtgccgtaaa catgagactg teg 
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